Integrating visual and audio cues for emotion and gender recognition: A multi modal and multi task approach
Shrinkage matters: evidence from accuracy-diversity trade-off in regression ensembles
FuseMeter: An efficient framework for generic per-flow traffic measurement
ViP-HMNN: a visual pathway-inspired hybrid neural network incorporated with in-memory computing for object recognition
ChatAssistDesign: A language-interactive framework for iterative vector floorplan generation via conditional diffusion
Vision-language models for person re-identification: a survey and outlook
Data augmentation with attentional feature aggregation for node classification in GNNs
A comprehensive survey and taxonomy of mamba: Applications, Challenges, and Future Directions
FedExIT - missing class-agnostic semi-supervised federated learning with extreme imbalance tackling scheme
Span-aware temporal aggregation network for video moment retrieval
ST-Imputer: Multivariate dependency-aware diffusion network with physics guidance for spatiotemporal imputation
PairHuman: A high-fidelity photographic dataset for customized dual-person generation
Multi-view anchor subspace clustering via consensus-specific reconstruction
GatedFusion-Net: Per-pixel modality weighting in a five-cue transformer for RGB-D-I-T-UV fusion
Interval-valued matrix factorization and knowledge-guided clustering for trust-aware cross-domain recommendation systems
PIFGSR: Pluggable framework for information fusion using generative artificial intelligence (GenAI) in recommender systems
Adaptive probabilistic information fusion under concept drift: a generalized bayesian framework
Enhancing structural condition assessment in steel pipelines via a WGAN-AAE data fusion methodology
Robust multimodal sentiment analysis via double information bottleneck
Anchor graph-guided dual-target alignment network for incomplete multi-view clustering
Aggregate twice more efficiently: Dual feature aggregation transformer for medical image segmentation
Context-aware and multi-view enhanced model for entity alignment
Utilizing hierarchical efficacy regions of the human brain for treatment prediction through information fusion
MPCL: Multimodal prompt learning for continual relation extraction with type-aware inter-modality alignment
Bridging RGB-T image fusion and semantic segmentation via multi-task collaborative learning
Consistency analysis of complex fuzzy preferences for two-stage group decision-making with risk perception of prospect theory
Matrix mixer analysis for time series classification: Attention on tokenization
GLUE3D: General language understanding evaluation for 3D point clouds
The duality of generative AI and reinforcement learning in robotics: A review
Fast fourier transform gated activation function (FFTGate)
Rethinking domain-agnostic continual learning via frequency completeness learning
Heterogeneous environment-aware multimodal recommendation with modality alignment
MSSDF: Modality-shared self-supervised distillation for high-resolution multi-modal remote sensing image learning
Federated learning based water streamflow forecasting via multi-sensor data fusion
One model connects all graphs: Towards training one unified model for multi-domain graph pre-training using adaptive vector quantization
DivineTree: All-in-one 3D tree modeling with diverse and fused visual guidance
Exploring structured uncertainty for external prior-guided hyperspectral image fusion
Self-attention and cross-modal attention for audio-visual zero-shot learning
LGINet: Linguistic guided image diffusion model for tree species generation and identification from aerial imagery
ConfShield: A dual-stage fusion framework for robust and privacy-enhancing federated learning in cloud environments
ComCon: Complementary-contradictory regularization for multimodal knowledge graph completion
PCF-LLM: scaling LLMs for multimodal understanding of structured scientific data in photonic crystal fiber sensors
PreciseVideo: a dual-process framework for zero-shot text-to-video generation with quantitative content control
GDCF-Net: A generative-discriminative contrastive fusion network for multi-class fault diagnosis of power transformer
GEPFNet: A group equivariant feature extraction with parallel fusion neural network for solar photovoltaic fault classification
Multimodal language models in agriculture: A tutorial and survey
AMGNet: Adaptive multi-granularity decoupling network for multimodal sarcasm detection
Multi-modal collaborative learning with vision foundation model prompt boosts 3D semi-supervised semantic segmentation
PMFM-kdTransformer: An enhanced multi-modal fusion architecture leveraging knowledge distillation for intra-hour solar irradiance prediction
MSIF-Convformer: a novel end-to-end fault diagnosis framework with multi-source sensors under strong noise