Variable Imaging Projection Cloud Scattering Tomography
MPS-NeRF: Generalizable 3D Human Rendering From Multiview Images
Towards Mixed-State Coded Diffraction Imaging
PS$^{2}$2 F: Polarized Spiral Point Spread Function for Single-Shot 3D Sensing
Physics to the Rescue: Deep Non-Line-of-Sight Reconstruction for High-Speed Imaging
Wide-Baseline Light Fields Using Ellipsoidal Mirrors
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
Revisiting One-Stage Deep Uncalibrated Photometric Stereo via Fourier Embedding
Bootstrap Masked Visual Modeling via Hard Patch Mining
Spatiotemporal Observer Design for Predictive Learning of High-Dimensional Data
Rethinking Efficient and Effective Point-Based Networks for Event Camera Classification and Regression
Human-Centric Fine-Grained Action Quality Assessment
Continual Unsupervised Generative Modeling
Uncertainty-Calibrated Test-Time Model Adaptation Without Forgetting
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation
DeferredGS: Decoupled and Relightable Gaussian Splatting With Deferred Shading
Revisiting Supervised Learning-Based Photometric Stereo Networks
StructVPR++: Distill Structural and Semantic Knowledge With Weighting Samples for Visual Place Recognition
Bi-Modality Individual-Aware Prompt Tuning for Visual-Language Model
Constraint Boundary Wandering Framework: Enhancing Constrained Optimization With Deep Neural Networks
Unknown-Aware Bilateral Dependency Optimization for Defending Against Model Inversion Attacks
Interpreting Low-Level Vision Models With Causal Effect Maps
Hyperrectangle Embedding for Debiased 3D Scene Graph Prediction From RGB Sequences
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks
AnyDoor: Zero-Shot Image Customization With Region-to-Region Reference
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection
PointNorm-Net: Self-Supervised Normal Prediction of 3D Point Clouds via Multi-Modal Distribution Estimation
Hadamard Product in Deep Learning: Introduction, Advances and Challenges
PonderV2: Improved 3D Representation With a Universal Pre-Training Paradigm
Toward Collaborative Autonomous Driving: Simulation Platform and End-to-End System
Diagnostic Captioning by Cooperative Task Interactions and Sample-Graph Consistency
End-to-End Open-Vocabulary Video Visual Relationship Detection Using Multi-Modal Prompting
Exploring the Essence of Relationships for Scene Graph Generation via Causal Features Enhancement Network
DDM: A Metric for Comparing 3D Shapes Using Directional Distance Fields
On the Value of Myopic Behavior in Policy Reuse
Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning
Interactive Conversational Head Generation
HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient Tuning
HOT: An Efficient Halpern Accelerating Algorithm for Optimal Transport Problems
Normalized-Full-Palmar-Hand: Toward More Accurate Hand-Based Multimodal Biometrics
Unified Domain Adaptive Semantic Segmentation
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration
Rethinking Evaluation Metrics of Open-Vocabulary Segmentation
Prophet: Prompting Large Language Models With Complementary Answer Heuristics for Knowledge-Based Visual Question Answering
Monge-Ampere Regularization for Learning Arbitrary Shapes From Point Clouds
Diff-Retinex++: Retinex-Driven Reinforced Diffusion Model for Low-Light Image Enhancement
Sparse-DeRF: Deblurred Neural Radiance Fields From Sparse View