Integrating visual and audio cues for emotion and gender recognition: A multi modal and multi task approach
Shrinkage matters: evidence from accuracy-diversity trade-off in regression ensembles
FuseMeter: An efficient framework for generic per-flow traffic measurement
ViP-HMNN: a visual pathway-inspired hybrid neural network incorporated with in-memory computing for object recognition
ChatAssistDesign: A language-interactive framework for iterative vector floorplan generation via conditional diffusion
Vision-language models for person re-identification: a survey and outlook
Data augmentation with attentional feature aggregation for node classification in GNNs
A comprehensive survey and taxonomy of mamba: Applications, Challenges, and Future Directions
FedExIT - missing class-agnostic semi-supervised federated learning with extreme imbalance tackling scheme
Span-aware temporal aggregation network for video moment retrieval
ST-Imputer: Multivariate dependency-aware diffusion network with physics guidance for spatiotemporal imputation
IAENet: An importance-aware ensemble model for 3D point cloud-based anomaly detection
Parasite: planting durable backdoors against continual federated model fusion via discriminative feature pushing-pulling
Uncertainty-aware multi-view evidence fusion for feature selection in brain network analysis
Survey of uncertainty estimation in LLMs - Sources, methods, applications, and challenges
Attention-driven contrastive learning for cross-modal hashing with prototypical separation
EBMADDPG: Shapley-based explainable moving target defense for edge intelligence-enabled SIoT systems via joint Bayesian Markov games and DRL
Multimodal brain network analysis: Research advances and challenges
MFF-MTT: a multi-feature fusion-based deep learning algorithm for maneuvering target tracking
A comprehensive benchmark of spatial encoding methods for tabular data with deep neural networks
VLDBench Evaluating multimodal disinformation with regulatory alignment
Deep learning-based astronomical multimodal data fusion: A comprehensive review
Dimensional compensation for small-sample and small-size insulator burn mark via RGB-point cloud fusion in power grid inspection
TPIN: Text-based parallel interaction network with modality-common and modality-specific for multimodal sentiment analysis
Multi-source information fusion through tucker tensor decomposition-based transfer learning for handwriting-Based Alzheimer's disease detection
Hierarchical cross-module knowledge transfer based on structural multi-view least squares support vector classification
MoMD Transformer: adaptive multi-modal fault diagnosis via knowledge transfer with vibration-current signals
GCEPANet: A lightweight and efficient remote sensing image cloud removal network model for optical-SAR image fusion
Subgraph-focused biomedical knowledge embedding with bi-semantic encoder for multi-type drug-drug interaction prediction
Generating vision-language navigation instructions incorporated fine-grained alignment annotations
SynJAC: synthetic-data-driven joint-granular adaptation and calibration for domain specific scanned document key information extraction
Multi-modal and multi-condition fault diagnosis of rotating machinery via a heterogeneous graph learning framework
DepressInstruct: Instruction tuning of large speech-language models for depression detection
Rethink: reveal the impact of semantic distribution transfer from the cross-modal hashing perspective
Few-shot harmful meme detection via self-adaption mixture-of-experts
Scoping review of multimodal sentiment analysis and summarization: State of the art, challenges and future directions
Bridging the sim-to-real gap in RF localization with large-scale synthetic pretraining
Internet meme on social media: A comprehensive review and new perspectives
A hierarchical information policy fusion framework with multimodal large language models for autonomous guidewire navigation in endovascular procedures
Style-augmented large-scale vision model with domain-generalized knowledge fusion for anomaly detection in powder bed additive manufacturing
Progressive temporal compensation and semantic enhancement for Exo-to-Ego video generation
Regional defeats global: An efficient regional feature fusion via convolutional architecture for multispectral object detection
SG-DGLF: A similarity-guided dual-graph learning framework
HFPN: Hierarchical fusion and prediction network with multi-level cross-modality relation learning for audio-visual event localization
PairHuman: A high-fidelity photographic dataset for customized dual-person generation
Multi-view anchor subspace clustering via consensus-specific reconstruction
GatedFusion-Net: Per-pixel modality weighting in a five-cue transformer for RGB-D-I-T-UV fusion
Interval-valued matrix factorization and knowledge-guided clustering for trust-aware cross-domain recommendation systems
PIFGSR: Pluggable framework for information fusion using generative artificial intelligence (GenAI) in recommender systems
Adaptive probabilistic information fusion under concept drift: a generalized bayesian framework