Geometry Fidelity for Spherical Images

التفاصيل البيبلوغرافية
العنوان: Geometry Fidelity for Spherical Images
المؤلفون: Christensen, Anders, Mojab, Nooshin, Patel, Khushman, Ahuja, Karan, Akata, Zeynep, Winther, Ole, Gonzalez-Franco, Mar, Colaco, Andrea
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
الوصف: Spherical or omni-directional images offer an immersive visual format appealing to a wide range of computer vision applications. However, geometric properties of spherical images pose a major challenge for models and metrics designed for ordinary 2D images. Here, we show that direct application of Fr\'echet Inception Distance (FID) is insufficient for quantifying geometric fidelity in spherical images. We introduce two quantitative metrics accounting for geometric constraints, namely Omnidirectional FID (OmniFID) and Discontinuity Score (DS). OmniFID is an extension of FID tailored to additionally capture field-of-view requirements of the spherical format by leveraging cubemap projections. DS is a kernel-based seam alignment score of continuity across borders of 2D representations of spherical images. In experiments, OmniFID and DS quantify geometry fidelity issues that are undetected by FID.
Comment: Accepted at ECCV 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.18207
رقم الأكسشن: edsarx.2407.18207
قاعدة البيانات: arXiv