دورية أكاديمية

Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning

التفاصيل البيبلوغرافية
العنوان: Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
المؤلفون: Yubiry Gonzalez, Ronaldo C. Prati
المصدر: Eng, Vol 4, Iss 1, Pp 555-568 (2023)
بيانات النشر: MDPI AG, 2023.
سنة النشر: 2023
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: musical timbre, FFT, musical instruments, acoustic descriptors, machine learning, data analysis, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: Musical timbre is a phenomenon of auditory perception that allows the recognition of musical sounds. The recognition of musical timbre is a challenging task because the timbre of a musical instrument or sound source is a complex and multifaceted phenomenon that is influenced by a variety of factors, including the physical properties of the instrument or sound source, the way it is played or produced, and the recording and processing techniques used. In this paper, we explore an abstract space with 7 dimensions formed by the fundamental frequency and FFT-Acoustic Descriptors in 240 monophonic sounds from the Tinysol and Good-Sounds databases, corresponding to the fourth octave of the transverse flute and clarinet. This approach allows us to unequivocally define a collection of points and, therefore, a timbral space (Category Theory) that allows different sounds of any type of musical instrument with its respective dynamics to be represented as a single characteristic vector. The geometric distance would allow studying the timbral similarity between audios of different sounds and instruments or between different musical dynamics and datasets. Additionally, a Machine-Learning algorithm that evaluates timbral similarities through Euclidean distances in the abstract space of 7 dimensions was proposed. We conclude that the study of timbral similarity through geometric distances allowed us to distinguish between audio categories of different sounds and musical instruments, between the same type of sound and an instrument with different relative dynamics, and between different datasets.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2673-4117
Relation: https://www.mdpi.com/2673-4117/4/1/33; https://doaj.org/toc/2673-4117
DOI: 10.3390/eng4010033
URL الوصول: https://doaj.org/article/f461a82f13794e8e8191d4c4fbb4fb5e
رقم الأكسشن: edsdoj.f461a82f13794e8e8191d4c4fbb4fb5e
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:26734117
DOI:10.3390/eng4010033