SpatialMoR-VGGT: Spatially Adaptive Efficient 3D Scene Reconstruction
  • Author(s): Saksham Gupta ; Dr. Ramanjot Kaur
  • Paper ID: 1710307
  • Page: 830-839
  • Published Date: 26-08-2025
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 9 Issue 2 August-2025
Abstract

We present SpatialMoR-VGGT, a novel framework that extends the Mixture-of-Recursions (MoR) paradigm to spatial reasoning in 3D vision tasks. While VGGT has demonstrated remarkable capabilities as a feed-forward transformer that directly infers all key 3D attributes of a scene—including camera parameters, point maps, depth maps, and point tracks—it processes all spatial regions with uniform computational depth. Our framework dynamically adjusts the recursion depth for different spatial regions of the scene, allocating more computational resources to complex areas while maintaining efficiency in simpler regions. This adaptation requires addressing fundamental differences between sequential token processing in language and spatially coherent processing in vision. We introduce spatially-aware routing mechanisms and KV caching strategies specifically designed for visual data, along with a balanced training objective that preserves spatial coherence while enabling adaptive computation. Through rigorous experimentation on standard 3D reconstruction benchmarks, we demonstrate that SpatialMoR-VGGT achieves comparable reconstruction quality to standard VGGT with 18-22% reduced computational requirements. This work establishes a foundation for adaptive computation in 3D vision tasks, with potential applications across AR/VR, robotics, and real-time 3D content creation.

Keywords

3D Reconstruction, Adaptive Computation, Recursive Transformers, Visual Geometry

Citations

IRE Journals:
Saksham Gupta , Dr. Ramanjot Kaur "SpatialMoR-VGGT: Spatially Adaptive Efficient 3D Scene Reconstruction" Iconic Research And Engineering Journals Volume 9 Issue 2 2025 Page 830-839

IEEE:
Saksham Gupta , Dr. Ramanjot Kaur "SpatialMoR-VGGT: Spatially Adaptive Efficient 3D Scene Reconstruction" Iconic Research And Engineering Journals, 9(2)