Toward Better Generalization Bounds of Stochastic Optimization for Nonconvex Learning
A Comprehensive Survey on Evidential Deep Learning and its Applications
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Estimating Fog Parameters From a Sequence of Stereo Images
High-Resolution Photo Enhancement in Real-Time: A Laplacian Pyramid Network
Improving Model Fusion by Training-Time Neuron Alignment With Fixed Neuron Anchors
Uncertainty-Aware Disentangled Dynamic Graph Attention Network for Out-of-Distribution Generalization
Improving Embedding of Graphs With Missing Data by Soft Manifolds
Graph Quality Matters on Revealing the Semantics Behind the Data in Physical World
Causality-Driven Convolutional Manifold Attention Network for Electroencephalogram Signal Decoding
NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes Without References
PAVM: Progressive and Adaptive Variance Minimization Algorithm for Robust Registration
Refine, Control and Distill: A Text-to-Image Framework for Faithful Image Generation
Topo4D++: Realistic Physically Based 4D Head Capture With Topology-Preserving Gaussian Splatting and Expression Priors
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging
Gaussian Mixture Conditional Variational Recurrent Neural Network for Unified Trajectory Imputation and Prediction
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Data Augmentation With Regularization for Multi-Labeled Complementary Label Learning
A Gravity-Informed Spatiotemporal Transformer for Human Activity Intensity Prediction
Good Performance Estimation Strategies are All You Need in Neural Architecture Search
DynamicPAE: Generating Scene-Aware Physical Adversarial Examples in Real-Time
Bayesian Window Transformer for Image Restoration
Neural Eigenfunctions are Structured Representation Learners
Joint Short-Term Origin-Destination Demand Prediction for Multimodal Transport Systems
A Review of Uncertainty Representation and Quantification in Neural Networks
Energy-Based Model for Accurate Estimation of Shapley Values in Feature Attribution
Robust Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
SED++: A Simple Encoder-Decoder for Improved Open-Vocabulary Semantic Segmentation
Demystifying Higher-Order Graph Neural Networks
OoDBench+: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
Structural Similarity in Deep Features: Unified Image Quality Assessment Robust to Geometrically Disparate Reference
SpeechPalette: A Comprehensive Speech Editing Method for Text-Based Speech Editing, One-Shot TTS and Attributes Editing
Sparse Trajectory Prediction
MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective
Joint Sparse Optical Flow Estimation and Keypoint Detection via Dual-task Imperative Learning
HAT: Hybrid Attention Transformer for Image Restoration
AdaGen: Learning Adaptive Policy for Image Synthesis
Evolving Graph Learning for Out-of-Distribution Generalization in Non-Stationary Environments
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition
Toward Visual Grounding: A Survey
FlexPara: Flexible Neural Surface Parameterization
Variational Bayesian Semi-Supervised Keyword Extraction
${\text{CA}^{2}\text{ST}}$: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
UniVST: A Unified Framework for Training-Free Localized Video Style Transfer
DifFlow3D: Hierarchical Diffusion Models for Uncertainty-Aware 3D Scene Flow Estimation
Condition Numbers in Multiview Geometry, Instability in Relative Pose Estimation, and RANSAC
Large-Scale Omnidirectional Person Positioning
SPAN: Learning Similarity Between Scene Graphs and Images With Transformers
A Visual Benchmark for Autonomous Driving in Open-Pit Mines