5

Emergent Outlier View Rejection in Visual Geometry Grounded Transformers
Thinking in 360°: Humanoid Visual Search in the Wild
Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI
Adversarial Exploitation of Data Diversity Improves Visual Localization
Extrapolated Urban View Synthesis Benchmark
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Multiview Scene Graph