STraDa: A Singer Traits Dataset

التفاصيل البيبلوغرافية
العنوان: STraDa: A Singer Traits Dataset
المؤلفون: Kong, Yuexuan, Tran, Viet-Anh, Hennequin, Romain
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: There is a limited amount of large-scale public datasets that contain downloadable music audio files and rich lead singer metadata. To provide such a dataset to benefit research in singing voices, we created Singer Traits Dataset (STraDa) with two subsets: automatic-strada and annotated-strada. The automatic-strada contains twenty-five thousand tracks across numerous genres and languages of more than five thousand unique lead singers, which includes cross-validated lead singer metadata as well as other track metadata. The annotated-strada consists of two hundred tracks that are balanced in terms of 2 genders, 5 languages, and 4 age groups. To show its use for model training and bias analysis thanks to its metadata's richness and downloadable audio files, we benchmarked singer sex classification (SSC) and conducted bias analysis.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2406.04140
رقم الأكسشن: edsarx.2406.04140
قاعدة البيانات: arXiv