MulMoSenT: Multimodal sentiment analysis for a low-resource language using textual-visual cross-attention and fusion
Speech emotion recognition: A systematic mega-review of techniques and pipelines
ExInCOACH: Strategic exploration meets interactive tutoring for context-aware game onboarding
Fusion of quantum computing with smart agriculture: A systematic review of methods, implementation, applications, and challenges
Dual-layer prompt ensembles: Leveraging system- and user-level instructions for robust LLM-based query expansion and rank fusion
Lifting wavelet transform-guided network with histogram attention for liver segmentation in CT scans
A novel knowledge distillation method for graph neural networks with gradient mapping and fusion
GeoCraft: A Diffusion Model-based 3D Reconstruction Method driven by image and point cloud fusion
GC-Fed: Gradient centralized federated learning with partial client participation
Validity-aware context modeling for gradient-guided image inpainting
Tokenized EEG signals with large language models for epilepsy detection via multimodal information fusion
Integrating visual and audio cues for emotion and gender recognition: A multi modal and multi task approach
Shrinkage matters: evidence from accuracy-diversity trade-off in regression ensembles
FuseMeter: An efficient framework for generic per-flow traffic measurement
ViP-HMNN: a visual pathway-inspired hybrid neural network incorporated with in-memory computing for object recognition
ChatAssistDesign: A language-interactive framework for iterative vector floorplan generation via conditional diffusion
Vision-language models for person re-identification: a survey and outlook
Data augmentation with attentional feature aggregation for node classification in GNNs
A comprehensive survey and taxonomy of mamba: Applications, Challenges, and Future Directions
FedExIT - missing class-agnostic semi-supervised federated learning with extreme imbalance tackling scheme
Span-aware temporal aggregation network for video moment retrieval
ST-Imputer: Multivariate dependency-aware diffusion network with physics guidance for spatiotemporal imputation
IAENet: An importance-aware ensemble model for 3D point cloud-based anomaly detection
Parasite: planting durable backdoors against continual federated model fusion via discriminative feature pushing-pulling
Uncertainty-aware multi-view evidence fusion for feature selection in brain network analysis
Survey of uncertainty estimation in LLMs - Sources, methods, applications, and challenges
Attention-driven contrastive learning for cross-modal hashing with prototypical separation
EBMADDPG: Shapley-based explainable moving target defense for edge intelligence-enabled SIoT systems via joint Bayesian Markov games and DRL
Multimodal brain network analysis: Research advances and challenges
MFF-MTT: a multi-feature fusion-based deep learning algorithm for maneuvering target tracking
A comprehensive benchmark of spatial encoding methods for tabular data with deep neural networks
VLDBench Evaluating multimodal disinformation with regulatory alignment
Deep learning-based astronomical multimodal data fusion: A comprehensive review
Dimensional compensation for small-sample and small-size insulator burn mark via RGB-point cloud fusion in power grid inspection
TPIN: Text-based parallel interaction network with modality-common and modality-specific for multimodal sentiment analysis
Multi-source information fusion through tucker tensor decomposition-based transfer learning for handwriting-Based Alzheimer's disease detection
Hierarchical cross-module knowledge transfer based on structural multi-view least squares support vector classification
MoMD Transformer: adaptive multi-modal fault diagnosis via knowledge transfer with vibration-current signals
GCEPANet: A lightweight and efficient remote sensing image cloud removal network model for optical-SAR image fusion
Subgraph-focused biomedical knowledge embedding with bi-semantic encoder for multi-type drug-drug interaction prediction
Generating vision-language navigation instructions incorporated fine-grained alignment annotations
SynJAC: synthetic-data-driven joint-granular adaptation and calibration for domain specific scanned document key information extraction
Multi-modal and multi-condition fault diagnosis of rotating machinery via a heterogeneous graph learning framework
DepressInstruct: Instruction tuning of large speech-language models for depression detection
Rethink: reveal the impact of semantic distribution transfer from the cross-modal hashing perspective
Few-shot harmful meme detection via self-adaption mixture-of-experts
Scoping review of multimodal sentiment analysis and summarization: State of the art, challenges and future directions
Bridging the sim-to-real gap in RF localization with large-scale synthetic pretraining
Internet meme on social media: A comprehensive review and new perspectives
A hierarchical information policy fusion framework with multimodal large language models for autonomous guidewire navigation in endovascular procedures