EMVD dataset: a dataset of extreme vocal distortion techniques used in heavy metal

التفاصيل البيبلوغرافية
العنوان: EMVD dataset: a dataset of extreme vocal distortion techniques used in heavy metal
المؤلفون: Tailleur, Modan, Pinquier, Julien, Millot, Laurent, Vogel, Corsin, Lagrange, Mathieu
المصدر: 21st International Conference on Content-based Multimedia Indexing (CBMI), Gylfi {\TH}{\'o}r Gu{\dh}mundsson; Laurent Amsaleg; Omar Shahbaz Khan; Ralph Gasser; Shin'ichi Satoh; Maria Pegia; Aladine Chetouani; Bj{\"o}rn {\TH}{\'o}r J{\'o}nsson; Claudio Gennaro; Ewa Kijak; Ilias Gialampoukidis; Liting Zhou; Jenny Benois-Pineau; Stevan Rudinac, Sep 2024, Reykjavik, Iceland
سنة النشر: 2024
المجموعة: Computer Science
Physics (Other)
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Artificial Intelligence, Physics - Classical Physics
الوصف: In this paper, we introduce the Extreme Metal Vocals Dataset, which comprises a collection of recordings of extreme vocal techniques performed within the realm of heavy metal music. The dataset consists of 760 audio excerpts of 1 second to 30 seconds long, totaling about 100 min of audio material, roughly composed of 60 minutes of distorted voices and 40 minutes of clear voice recordings. These vocal recordings are from 27 different singers and are provided without accompanying musical instruments or post-processing effects. The distortion taxonomy within this dataset encompasses four distinct distortion techniques and three vocal effects, all performed in different pitch ranges. Performance of a state-of-the-art deep learning model is evaluated for two different classification tasks related to vocal techniques, demonstrating the potential of this resource for the audio processing community.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2406.17732
رقم الأكسشن: edsarx.2406.17732
قاعدة البيانات: arXiv