Mood Classification Using Listening Data

التفاصيل البيبلوغرافية
العنوان: Mood Classification Using Listening Data
المؤلفون: Korzeniowski, Filip, Nieto, Oriol, McCallum, Matthew, Won, Minz, Oramas, Sergio, Schmidt, Erik
سنة النشر: 2020
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Information Retrieval, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: The mood of a song is a highly relevant feature for exploration and recommendation in large collections of music. These collections tend to require automatic methods for predicting such moods. In this work, we show that listening-based features outperform content-based ones when classifying moods: embeddings obtained through matrix factorization of listening data appear to be more informative of a track mood than embeddings based on its audio content. To demonstrate this, we compile a subset of the Million Song Dataset, totalling 67k tracks, with expert annotations of 188 different moods collected from AllMusic. Our results on this novel dataset not only expose the limitations of current audio-based models, but also aim to foster further reproducible research on this timely topic.
Comment: Appears in Proc. of the International Society for Music Information Retrieval Conference 2020 (ISMIR 2020)
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2010.11512
رقم الأكسشن: edsarx.2010.11512
قاعدة البيانات: arXiv