Adversarial and generative AI-based anti-forensics in audio-visual deepfake detection: A comprehensive review and analysis
FedFusionNet: Advancing oral cancer recurrence prediction through federated fusion modeling
MuDeNet: A multi-patch descriptor network for anomaly modeling
CMVF: Cross-modal unregistered video fusion via spatio-temporal consistency
Lightweight music recommendation via multi-physiological feature fusion
CATCH: Causal attention enhanced meta-path semantic fusion for robust hyperbolic heterogeneous graph embedding
Multi-lingual approach for multi-modal emotion and sentiment recognition based on triple fusion
A unified optimization framework for backdoor attacks in large language models
Explainable visual question answering: A survey on methods, datasets and evaluation
PIA: Fusing edge prior information into attention for semantic segmentation in vision transformer
Bridging cognition and emotion: Empathy-driven multimodal misinformation detection
Sharable and discriminative multi-view geometry-adaptive fusion network for 3D dental model segmentation
The effect of data poisoning on counterfactual explanations
Multimodal action recognition for manufacturing assembly task through spatio-temporal knowledge fusion
An interpretable deep unfolding framework for multi-view representation learning
Multimodal dynamic fusion framework for survival prediction in clear cell renal cell carcinoma
Knowledge graph-augmented stacking for accurate bike-sharing demand forecasting: The ridegraph framework
FDA-CAPMA: Federated domain adaptation with co-activation pattern and multimodal mamba for fMRI depression detection
FusionBev: LiDAR and 4D radar fusion for 3D object detection
From privacy to trust in the agentic era: a taxonomy of challenges in trustworthy federated learning through the lens of trust report 2.0
A Bayesian approach to offline autonomous aerial search path generation
URL2Graph++: Unified semantic-structural-character learning for malicious URL detection
MMFN : A novel multi-view multimodal fusion network for pediatric intestinal obstruction recognition
MulMoSenT: Multimodal sentiment analysis for a low-resource language using textual-visual cross-attention and fusion
Speech emotion recognition: A systematic mega-review of techniques and pipelines
ExInCOACH: Strategic exploration meets interactive tutoring for context-aware game onboarding
Fusion of quantum computing with smart agriculture: A systematic review of methods, implementation, applications, and challenges
Dual-layer prompt ensembles: Leveraging system- and user-level instructions for robust LLM-based query expansion and rank fusion
Lifting wavelet transform-guided network with histogram attention for liver segmentation in CT scans
A novel knowledge distillation method for graph neural networks with gradient mapping and fusion
GeoCraft: A Diffusion Model-based 3D Reconstruction Method driven by image and point cloud fusion
GC-Fed: Gradient centralized federated learning with partial client participation
Validity-aware context modeling for gradient-guided image inpainting
Tokenized EEG signals with large language models for epilepsy detection via multimodal information fusion
A survey of multimodal fusion for Alzheimer’s disease prediction: A new taxonomy and trends
Multimodal spatio-temporal fusion: A generalizable GCN-LSTM with attention framework for urban application
GIAFormer: A Gradient-Infused Attention and Transformer for Pain Assessment with EDA-fNIRS Fusion
Code-driven programming prediction enhanced by LLM with a feature fusion approach
Data fusion for low-cost sensors: A systematic literature review
Adversarial perturbation for RGB-T tracking via intra-modal excavation and cross-modal collusion
IDFL: Incentive-driven federated learning with selfish clients
StegaFusion: Steganography for information hiding and fusion in multimodality
An adaptive regularized topological segmentation network integrating inter-class relations and occlusion information for vehicle component recognition
MSTFDN: An EEG-fNIRS multimodal spatial-temporal fusion decoding network for personalized multi-task scenarios
Unsupervised multimodal graph completion networks with multi-level contrastiveness for modality-missing conversation understanding
Crowdsourced federated learning with inconsistent label representation
Grading-inspired complementary enhancing for multimodal sentiment analysis
Vision-language model with siamese bilateral difference network and text-guided image feature enhancement for acute ischemic stroke outcome prediction on CT angiography
MCIVA: A multi-view pedestrian detection framework with a central inverse nearest neighbor map and a view adaptive module
Adaptive virtual anchors for efficient and stable clustering over large multi-view attributed graphs