Adversarial and generative AI-based anti-forensics in audio-visual deepfake detection: A comprehensive review and analysis FedFusionNet: Advancing oral cancer recurrence prediction through federated fusion modeling MuDeNet: A multi-patch descriptor network for anomaly modeling CMVF: Cross-modal unregistered video fusion via spatio-temporal consistency Lightweight music recommendation via multi-physiological feature fusion CATCH: Causal attention enhanced meta-path semantic fusion for robust hyperbolic heterogeneous graph embedding Multi-lingual approach for multi-modal emotion and sentiment recognition based on triple fusion A unified optimization framework for backdoor attacks in large language models Explainable visual question answering: A survey on methods, datasets and evaluation PIA: Fusing edge prior information into attention for semantic segmentation in vision transformer Bridging cognition and emotion: Empathy-driven multimodal misinformation detection Sharable and discriminative multi-view geometry-adaptive fusion network for 3D dental model segmentation The effect of data poisoning on counterfactual explanations Multimodal action recognition for manufacturing assembly task through spatio-temporal knowledge fusion An interpretable deep unfolding framework for multi-view representation learning Multimodal dynamic fusion framework for survival prediction in clear cell renal cell carcinoma Knowledge graph-augmented stacking for accurate bike-sharing demand forecasting: The ridegraph framework FDA-CAPMA: Federated domain adaptation with co-activation pattern and multimodal mamba for fMRI depression detection FusionBev: LiDAR and 4D radar fusion for 3D object detection From privacy to trust in the agentic era: a taxonomy of challenges in trustworthy federated learning through the lens of trust report 2.0 A Bayesian approach to offline autonomous aerial search path generation URL2Graph++: Unified semantic-structural-character learning for malicious URL detection MMFN : A novel multi-view multimodal fusion network for pediatric intestinal obstruction recognition MulMoSenT: Multimodal sentiment analysis for a low-resource language using textual-visual cross-attention and fusion Speech emotion recognition: A systematic mega-review of techniques and pipelines ExInCOACH: Strategic exploration meets interactive tutoring for context-aware game onboarding Fusion of quantum computing with smart agriculture: A systematic review of methods, implementation, applications, and challenges Dual-layer prompt ensembles: Leveraging system- and user-level instructions for robust LLM-based query expansion and rank fusion Lifting wavelet transform-guided network with histogram attention for liver segmentation in CT scans A novel knowledge distillation method for graph neural networks with gradient mapping and fusion GeoCraft: A Diffusion Model-based 3D Reconstruction Method driven by image and point cloud fusion GC-Fed: Gradient centralized federated learning with partial client participation Validity-aware context modeling for gradient-guided image inpainting Tokenized EEG signals with large language models for epilepsy detection via multimodal information fusion A survey of multimodal fusion for Alzheimer’s disease prediction: A new taxonomy and trends Multimodal spatio-temporal fusion: A generalizable GCN-LSTM with attention framework for urban application GIAFormer: A Gradient-Infused Attention and Transformer for Pain Assessment with EDA-fNIRS Fusion Code-driven programming prediction enhanced by LLM with a feature fusion approach Data fusion for low-cost sensors: A systematic literature review Adversarial perturbation for RGB-T tracking via intra-modal excavation and cross-modal collusion IDFL: Incentive-driven federated learning with selfish clients StegaFusion: Steganography for information hiding and fusion in multimodality An adaptive regularized topological segmentation network integrating inter-class relations and occlusion information for vehicle component recognition MSTFDN: An EEG-fNIRS multimodal spatial-temporal fusion decoding network for personalized multi-task scenarios Unsupervised multimodal graph completion networks with multi-level contrastiveness for modality-missing conversation understanding Crowdsourced federated learning with inconsistent label representation Grading-inspired complementary enhancing for multimodal sentiment analysis Vision-language model with siamese bilateral difference network and text-guided image feature enhancement for acute ischemic stroke outcome prediction on CT angiography MCIVA: A multi-view pedestrian detection framework with a central inverse nearest neighbor map and a view adaptive module Adaptive virtual anchors for efficient and stable clustering over large multi-view attributed graphs