A Unified Representation Framework for the Evaluation of Optical Music Recognition Systems

التفاصيل البيبلوغرافية
العنوان: A Unified Representation Framework for the Evaluation of Optical Music Recognition Systems
المؤلفون: Torras, Pau, Biswas, Sanket, Fornés, Alicia
المصدر: International Journal on Document Analysis and Recognition (IJDAR), Volume 27, 2024, pp. 379-393
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, I.4.9, J.5
الوصف: Modern-day Optical Music Recognition (OMR) is a fairly fragmented field. Most OMR approaches use datasets that are independent and incompatible between each other, making it difficult to both combine them and compare recognition systems built upon them. In this paper we identify the need of a common music representation language and propose the Music Tree Notation (MTN) format, with the idea to construct a common endpoint for OMR research that allows coordination, reuse of technology and fair evaluation of community efforts. This format represents music as a set of primitives that group together into higher-abstraction nodes, a compromise between the expression of fully graph-based and sequential notation formats. We have also developed a specific set of OMR metrics and a typeset score dataset as a proof of concept of this idea.
Comment: 18 pages, 4 figures, 3 tables, submitted (under review) for the International Journal in Document Analysis and Recognition
نوع الوثيقة: Working Paper
DOI: 10.1007/s10032-024-00485-8
URL الوصول: http://arxiv.org/abs/2312.12908
رقم الأكسشن: edsarx.2312.12908
قاعدة البيانات: arXiv
الوصف
DOI:10.1007/s10032-024-00485-8