دورية أكاديمية
LUMINA: Linguistic unified multimodal Indonesian natural audio-visual dataset
العنوان: | LUMINA: Linguistic unified multimodal Indonesian natural audio-visual dataset |
---|---|
المؤلفون: | Eka Rahayu Setyaningsih, Anik Nur Handayani, Wahyu Sakti Gunawan Irianto, Yosi Kristian, Christian Trisno Sen Long Chen |
المصدر: | Data in Brief, Vol 54, Iss , Pp 110279- (2024) |
بيانات النشر: | Elsevier, 2024. |
سنة النشر: | 2024 |
المجموعة: | LCC:Computer applications to medicine. Medical informatics LCC:Science (General) |
مصطلحات موضوعية: | Constrained audio-visual dataset, Lips reading, Speech synthesis, Face processing, Computer vision, Computer applications to medicine. Medical informatics, R858-859.7, Science (General), Q1-390 |
الوصف: | The LUMINA (Linguistic Unified Multimodal Indonesian Natural Audio-Visual) Dataset is a carefully curated constrained audio-visual dataset designed to support research in the field of speech perception. Spoken exclusively in Indonesian, LUMINA contains high-quality audio-visual recordings featuring 14 native speakers, including 9 males and 5 females. Each speaker contributes approximately 1,000 sentences, producing a rich and diverse data collection. The recorded videos focus on facial recordings, capturing essential visual cues and expressions that accompany speech. This extensive dataset provides a valuable resource for understanding how humans perceive and process spoken language, paving the way for speech recognition and synthesis technology advancements. |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 2352-3409 |
Relation: | http://www.sciencedirect.com/science/article/pii/S2352340924002488; https://doaj.org/toc/2352-3409 |
DOI: | 10.1016/j.dib.2024.110279 |
URL الوصول: | https://doaj.org/article/290a879896084e5e82ec23ab96d53eaa |
رقم الأكسشن: | edsdoj.290a879896084e5e82ec23ab96d53eaa |
قاعدة البيانات: | Directory of Open Access Journals |
تدمد: | 23523409 |
---|---|
DOI: | 10.1016/j.dib.2024.110279 |