Global Model Selection via Solution Paths for Robust Support Vector Machine
Self-Supervised Learning for Real-World Super-Resolution From Dual and Multiple Zoomed Observations
A Versatile Framework for Multi-Scene Person Re-Identification
Fast Window-Based Event Denoising With Spatiotemporal Correlation Enhancement
Imaginary-Connected Embedding in Complex Space for Unseen Attribute-Object Discrimination
Multi-Scale Part-Based Feature Representation for 3D Domain Generalization and Adaptation
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning
PATNAS: A Path-Based Training-Free Neural Architecture Search
Correlation Verification for Image Retrieval and Its Memory Footprint Optimization
BokehMe++: Harmonious Fusion of Classical and Neural Rendering for Versatile Bokeh Creation
Partial Scene Text Retrieval
Streaming Quanta Sensors for Online, High-Performance Imaging and Vision
DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
Fine-Grained Visual Text Prompting
Practical Compact Deep Compressed Sensing
IBCS: Learning Information Bottleneck-Constrained Denoised Causal Subgraph for Graph Classification
DiffAct++: Diffusion Action Segmentation
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression
Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
WinDB: HMD-Free and Distortion-Free Panoptic Video Fixation Learning
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Fair Clustering Ensemble With Equal Cluster Capacity
Adaptive Graph Learning With Semantic Promotability for Domain Adaptation
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
Uni-AdaFocus: Spatial-Temporal Dynamic Computation for Video Recognition
NAS-PED: Neural Architecture Search for Pedestrian Detection
Generalized Face Liveness Detection via De-Fake Face Generator
STAR: A First-Ever Dataset and a Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery
ELDP: Enhanced Label Distribution Propagation for Crowdsourcing
Unsupervised Global and Local Homography Estimation With Coplanarity-Aware GAN
LVLM-EHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
Scale Propagation Network for Generalizable Depth Completion
Iteratively Capped Reweighting Norm Minimization With Global Convergence Guarantee for Low-Rank Matrix Learning
Remembering What is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction
Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection
Spiking Variational Policy Gradient for Brain Inspired Reinforcement Learning
Spectrally-Corrected and Regularized LDA for Spiked Model
A Survey and Benchmark of Automatic Surface Reconstruction From Point Clouds
BEVFormer: Learning Bird’s-Eye-View Representation From LiDAR-Camera via Spatiotemporal Transformers
Saliency-Free and Aesthetic-Aware Panoramic Video Navigation
Fast Semi-Supervised Learning on Large Graphs: An Improved Green-Function Method
Fully-Connected Transformer for Multi-Source Image Fusion
Natural Adversarial Mask for Face Identity Protection in Physical World
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning
Continual Learning: Forget-Free Winning Subnetworks for Video Representations
Trajectory of Fifths Based on Chroma Subbands Extraction–A New Approach to Music Representation, Analysis, and Classification