MulMoSenT: Multimodal sentiment analysis for a low-resource language using textual-visual cross-attention and fusion
Speech emotion recognition: A systematic mega-review of techniques and pipelines
ExInCOACH: Strategic exploration meets interactive tutoring for context-aware game onboarding
Fusion of quantum computing with smart agriculture: A systematic review of methods, implementation, applications, and challenges
Dual-layer prompt ensembles: Leveraging system- and user-level instructions for robust LLM-based query expansion and rank fusion
Lifting wavelet transform-guided network with histogram attention for liver segmentation in CT scans
A novel knowledge distillation method for graph neural networks with gradient mapping and fusion
GeoCraft: A Diffusion Model-based 3D Reconstruction Method driven by image and point cloud fusion
GC-Fed: Gradient centralized federated learning with partial client participation
Validity-aware context modeling for gradient-guided image inpainting
Tokenized EEG signals with large language models for epilepsy detection via multimodal information fusion
A survey of multimodal fusion for Alzheimer’s disease prediction: A new taxonomy and trends
Multimodal spatio-temporal fusion: A generalizable GCN-LSTM with attention framework for urban application
GIAFormer: A Gradient-Infused Attention and Transformer for Pain Assessment with EDA-fNIRS Fusion
Code-driven programming prediction enhanced by LLM with a feature fusion approach
Data fusion for low-cost sensors: A systematic literature review
Adversarial perturbation for RGB-T tracking via intra-modal excavation and cross-modal collusion
IDFL: Incentive-driven federated learning with selfish clients
StegaFusion: Steganography for information hiding and fusion in multimodality
An adaptive regularized topological segmentation network integrating inter-class relations and occlusion information for vehicle component recognition
MSTFDN: An EEG-fNIRS multimodal spatial-temporal fusion decoding network for personalized multi-task scenarios
Unsupervised multimodal graph completion networks with multi-level contrastiveness for modality-missing conversation understanding
Crowdsourced federated learning with inconsistent label representation
Grading-inspired complementary enhancing for multimodal sentiment analysis
Vision-language model with siamese bilateral difference network and text-guided image feature enhancement for acute ischemic stroke outcome prediction on CT angiography
MCIVA: A multi-view pedestrian detection framework with a central inverse nearest neighbor map and a view adaptive module
Adaptive virtual anchors for efficient and stable clustering over large multi-view attributed graphs
Information-theoretic graph fusion with vision-language-action model for policy reasoning and dual robotic control
All-weather multi-modality image fusion: Unified framework and 100k benchmark
Large multimodal models for low-resource languages: A survey
Unleashing Mamba’s expressive power: A non-tradeoff approach to spatio-temporal forecasting
SU-RMT: Toward bridging semantic representation and structural detail modeling for medical image segmentation
Multiple channel access and power control for discount-average weighting criterion over multi-sensor and Markovian fading environments
A two-stage learning network for PVINS modeling and fusion estimation in challenging environments
Shape-aware osteoarthritis network: Bidirectional fusion of MRI and 3D point clouds for knee osteoarthritis diagnosis
PGSC: A gradient sparsification communication optimization criterion for nonequilibrium thermodynamics
Integrating visual and audio cues for emotion and gender recognition: A multi modal and multi task approach
Shrinkage matters: evidence from accuracy-diversity trade-off in regression ensembles
FuseMeter: An efficient framework for generic per-flow traffic measurement
ViP-HMNN: a visual pathway-inspired hybrid neural network incorporated with in-memory computing for object recognition
ChatAssistDesign: A language-interactive framework for iterative vector floorplan generation via conditional diffusion
Vision-language models for person re-identification: a survey and outlook
Data augmentation with attentional feature aggregation for node classification in GNNs
A comprehensive survey and taxonomy of mamba: Applications, Challenges, and Future Directions
FedExIT - missing class-agnostic semi-supervised federated learning with extreme imbalance tackling scheme
Span-aware temporal aggregation network for video moment retrieval
ST-Imputer: Multivariate dependency-aware diffusion network with physics guidance for spatiotemporal imputation
IAENet: An importance-aware ensemble model for 3D point cloud-based anomaly detection
Parasite: planting durable backdoors against continual federated model fusion via discriminative feature pushing-pulling
Uncertainty-aware multi-view evidence fusion for feature selection in brain network analysis