[2025-03-08] By default not using DPVO. We implemented a SimpleVO, which is more efficient and compatible with GVHMR. [2025-03-08] We added a new option f_mm to specify the focal length of the ...
Previous research has investigated the application of Multimodal Large Language Models (MLLMs) in understanding 3D scenes by interpreting them as videos. These approaches generally depend on ...