Monocular Gaussian SLAM with Language Extended Loop Closure

التفاصيل البيبلوغرافية
العنوان: Monocular Gaussian SLAM with Language Extended Loop Closure
المؤلفون: Lan, Tian, Lin, Qinwei, Wang, Haoqian
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: Recently,3DGaussianSplattinghasshowngreatpotentialin visual Simultaneous Localization And Mapping (SLAM). Existing methods have achieved encouraging results on RGB-D SLAM, but studies of the monocular case are still scarce. Moreover, they also fail to correct drift errors due to the lack of loop closure and global optimization. In this paper, we present MG-SLAM, a monocular Gaussian SLAM with a language-extended loop closure module capable of performing drift-corrected tracking and high-fidelity reconstruction while achieving a high-level understanding of the environment. Our key idea is to represent the global map as 3D Gaussian and use it to guide the estimation of the scene geometry, thus mitigating the efforts of missing depth information. Further, an additional language-extended loop closure module which is based on CLIP feature is designed to continually perform global optimization to correct drift errors accumulated as the system runs. Our system shows promising results on multiple challenging datasets in both tracking and mapping and even surpasses some existing RGB-D methods.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2405.13748
رقم الأكسشن: edsarx.2405.13748
قاعدة البيانات: arXiv