YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-Time Object Detection
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-Identification
CLRNetV2: A Faster and Stronger Lane Detector
LoCo: Low-Bit Communication Adaptor for Large-Scale Model Training
$S^{2}$S2-Transformer for Mask-Aware Hyperspectral Image Reconstruction
Optimization of Rank Losses for Image Retrieval
Improving Adversarial Training From the Perspective of Class-Flipping Distribution
Equivariant Diffusion Model With A5-Group Neurons for Joint Pose Estimation and Shape Reconstruction
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation
Shared Growth of Graph Neural Networks via Prompted Free-Direction Knowledge Distillation
Revisiting Gradient-Based Uncertainty for Monocular Depth Estimation
Diffusion Model-Based Image Editing: A Survey
Wholly-WOOD: Wholly Leveraging Diversified-Quality Labels for Weakly-Supervised Oriented Object Detection
Pixel-Inconsistency Modeling for Image Manipulation Localization
Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation
Learning Without Forgetting for Vision-Language Models
A Variational Bayesian Inference Theory of Elasticity and Its Mixed Probabilistic Finite Element Method for Inverse Deformation Solutions in Any Dimension
Local Texture Pattern Estimation for Image Detail Super-Resolution
Gait Recognition in the Wild: A Large-Scale Benchmark and NAS-Based Baseline
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Correlated Topic Modeling for Short Texts in Spherical Embedding Spaces
AUCPro: AUC-Oriented Provable Robustness Learning
Heterogeneous Correlation Aware Regularization for Sequential Confidence Calibration
Pixel2Pixel: A Pixelwise Approach for Zero-Shot Single Image Denoising
Diffusion Models in Low-Level Vision: A Survey
The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields
Context Perception Parallel Decoder for Scene Text Recognition
A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring
Replay Without Saving: Prototype Derivation and Distribution Rebalance for Class-Incremental Semantic Segmentation
Attack as Defense: Proactive Adversarial Multi-Modal Learning to Evade Retrieval
MOVE: Effective and Harmless Ownership Verification via Embedded External Features
Learning Emotion Category Representation to Detect Emotion Relations Across Languages
NER-Net+: Seeing Motion at Nighttime With an Event Camera
Learning Probabilistic Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
DFedADMM: Dual Constraint Controlled Model Inconsistency for Decentralize Federated Learning
Unified Prompt Attack Against Text-to-Image Generation Models
Bayesian Variance Change Point Detection With Credible Sets
Towards a Theoretical Understanding of Semi-Supervised Learning Under Class Distribution Mismatch
Hessian-Aware Zeroth-Order Optimization
CLIP-Driven Transformer for Weakly Supervised Object Localization
Gauging-$\delta$δ: A Non-Parametric Hierarchical Clustering Algorithm
PhysMLE: Generalizable and Priors-Inclusive Multi-Task Remote Physiological Measurement
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting
Reliable Representation Learning for Incomplete Multi-View Missing Multi-Label Classification
A Lightweight Deep Exclusion Unfolding Network for Single Image Reflection Removal
Systematic Bias of Machine Learning Regression Models and Correction
Class-Agnostic Repetitive Action Counting Using Wearable Devices
Spatial Residual for Underwater Object Detection
On the Upper Bounds of Number of Linear Regions and Generalization Error of Deep Convolutional Neural Networks
Graph Foundation Models: Concepts, Opportunities and Challenges