PairHuman: A high-fidelity photographic dataset for customized dual-person generation
Multi-view anchor subspace clustering via consensus-specific reconstruction
GatedFusion-Net: Per-pixel modality weighting in a five-cue transformer for RGB-D-I-T-UV fusion
Interval-valued matrix factorization and knowledge-guided clustering for trust-aware cross-domain recommendation systems
PIFGSR: Pluggable framework for information fusion using generative artificial intelligence (GenAI) in recommender systems
Adaptive probabilistic information fusion under concept drift: a generalized bayesian framework
Enhancing structural condition assessment in steel pipelines via a WGAN-AAE data fusion methodology
Robust multimodal sentiment analysis via double information bottleneck
Anchor graph-guided dual-target alignment network for incomplete multi-view clustering
Aggregate twice more efficiently: Dual feature aggregation transformer for medical image segmentation
Context-aware and multi-view enhanced model for entity alignment
Utilizing hierarchical efficacy regions of the human brain for treatment prediction through information fusion
MPCL: Multimodal prompt learning for continual relation extraction with type-aware inter-modality alignment
Bridging RGB-T image fusion and semantic segmentation via multi-task collaborative learning
Consistency analysis of complex fuzzy preferences for two-stage group decision-making with risk perception of prospect theory
Matrix mixer analysis for time series classification: Attention on tokenization
GLUE3D: General language understanding evaluation for 3D point clouds
The duality of generative AI and reinforcement learning in robotics: A review
Fast fourier transform gated activation function (FFTGate)
Rethinking domain-agnostic continual learning via frequency completeness learning
Heterogeneous environment-aware multimodal recommendation with modality alignment
MSSDF: Modality-shared self-supervised distillation for high-resolution multi-modal remote sensing image learning
Federated learning based water streamflow forecasting via multi-sensor data fusion
One model connects all graphs: Towards training one unified model for multi-domain graph pre-training using adaptive vector quantization
DivineTree: All-in-one 3D tree modeling with diverse and fused visual guidance
Exploring structured uncertainty for external prior-guided hyperspectral image fusion
Self-attention and cross-modal attention for audio-visual zero-shot learning
LGINet: Linguistic guided image diffusion model for tree species generation and identification from aerial imagery
ConfShield: A dual-stage fusion framework for robust and privacy-enhancing federated learning in cloud environments
ComCon: Complementary-contradictory regularization for multimodal knowledge graph completion
PCF-LLM: scaling LLMs for multimodal understanding of structured scientific data in photonic crystal fiber sensors
PreciseVideo: a dual-process framework for zero-shot text-to-video generation with quantitative content control
GDCF-Net: A generative-discriminative contrastive fusion network for multi-class fault diagnosis of power transformer
GEPFNet: A group equivariant feature extraction with parallel fusion neural network for solar photovoltaic fault classification
Multimodal language models in agriculture: A tutorial and survey
AMGNet: Adaptive multi-granularity decoupling network for multimodal sarcasm detection
Multi-modal collaborative learning with vision foundation model prompt boosts 3D semi-supervised semantic segmentation
PMFM-kdTransformer: An enhanced multi-modal fusion architecture leveraging knowledge distillation for intra-hour solar irradiance prediction
MSIF-Convformer: a novel end-to-end fault diagnosis framework with multi-source sensors under strong noise
Zonotopic set-membership state estimation for linear repetitive processes with multirate measurements under encoding-decoding mechanisms
Robust fusion filtering for 2-D multi-sensor systems with measurement censoring and redundant channels
A real-time surface defect detection model based on adaptive feature information selection and fusion
MMME: A spontaneous multi-modal micro-expression dataset enabling visual-physiological fusion
SaSAM: Scale-aware segmentation anything model for multimodal remote sensing images
A review of fake news detection based on transfer learning
D-RGCN: Software defect prediction based on dual directed dependency graph reconstruction
Mitigating class imbalance in forest fire prediction with GAN-Augmented data fusion
Diverse semantic representation learning based on vision-language models for zero-shot indoor scene recognition
Bridging the gap between computer vision and bioelectrical signal analysis
Refinement-Guided Critique Learning: A Framework for Training Critique Models