2

EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction
Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset
ActFormer: Scalable Collaborative Perception via Active Queries
NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation
Tell Me Where You Are: Multimodal LLMs Meet Place Recognition
LiDAR-based 4D Occupancy Completion and Forecasting
Among Us: Adversarially Robust Collaborative Perception by Consensus
SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Collaborative Multi-Object Tracking with Conformal Uncertainty Propagation
Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space