Perceptual Evaluation on Audio-visual Dataset of 360 Content

التفاصيل البيبلوغرافية
العنوان: Perceptual Evaluation on Audio-visual Dataset of 360 Content
المؤلفون: Fela, Randy F, Pastor, Andréas, Callet, Patrick Le, Zacharov, Nick, Vigier, Toinon, Forchhammer, Søren
سنة النشر: 2022
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Multimedia, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Image and Video Processing
الوصف: To open up new possibilities to assess the multimodal perceptual quality of omnidirectional media formats, we proposed a novel open source 360 audiovisual (AV) quality dataset. The dataset consists of high-quality 360 video clips in equirectangular (ERP) format and higher-order ambisonic (4th order) along with the subjective scores. Three subjective quality experiments were conducted for audio, video, and AV with the procedures detailed in this paper. Using the data from subjective tests, we demonstrated that this dataset can be used to quantify perceived audio, video, and audiovisual quality. The diversity and discriminability of subjective scores were also analyzed. Finally, we investigated how our dataset correlates with various objective quality metrics of audio and video. Evidence from the results of this study implies that the proposed dataset can benefit future studies on multimodal quality evaluation of 360 content.
Comment: 6 pages, 5 figures, International Conference on Multimedia and Expo 2022
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2205.08007
رقم الأكسشن: edsarx.2205.08007
قاعدة البيانات: arXiv