Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published Jul 10, 2025 • 34
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Paper • 2508.01242 • Published Aug 2, 2025 • 11
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Paper • 2603.18002 • Published Mar 18 • 13
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Paper • 2603.19235 • Published Mar 19 • 95
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published Mar 23 • 46
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM Paper • 2603.23386 • Published Mar 24 • 40
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published 28 days ago • 72
RigidFormer: Learning Rigid Dynamics using Transformers Paper • 2605.09196 • Published 19 days ago • 14
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Paper • 2605.23888 • Published 6 days ago • 11
SpatialBench: Is Your Spatial Foundation Model an All-Round Player? Paper • 2605.27367 • Published 2 days ago • 62