Learning Multi-modal Information for Robust Light Field Depth Estimation

التفاصيل البيبلوغرافية
العنوان:	Learning Multi-modal Information for Robust Light Field Depth Estimation
المؤلفون:	Piao, Yongri, Ji, Xinxin, Zhang, Miao, Zhang, Yukun
سنة النشر:	2021
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Computer Vision and Pattern Recognition
الوصف:	Light field data has been demonstrated to facilitate the depth estimation task. Most learning-based methods estimate the depth infor-mation from EPI or sub-aperture images, while less methods pay attention to the focal stack. Existing learning-based depth estimation methods from the focal stack lead to suboptimal performance because of the defocus blur. In this paper, we propose a multi-modal learning method for robust light field depth estimation. We first excavate the internal spatial correlation by designing a context reasoning unit which separately extracts comprehensive contextual information from the focal stack and RGB images. Then we integrate the contextual information by exploiting a attention-guide cross-modal fusion module. Extensive experiments demonstrate that our method achieves superior performance than existing representative methods on two light field datasets. Moreover, visual results on a mobile phone dataset show that our method can be widely used in daily life.
نوع الوثيقة:	Working Paper
URL الوصول:	http://arxiv.org/abs/2104.05971
رقم الأكسشن:	edsarx.2104.05971
قاعدة البيانات:	arXiv

الوصف
الوصف غير متاح.