Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework

التفاصيل البيبلوغرافية
العنوان: Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework
المؤلفون: Keikichi Hirose, Nobuaki Minematsu, Josef R. Novak
المصدر: Natural Language Engineering. 22:907-938
بيانات النشر: Cambridge University Press (CUP), 2015.
سنة النشر: 2015
مصطلحات موضوعية: Linguistics and Language, Sequence, Computer science, business.industry, Grapheme, 02 engineering and technology, Machine learning, computer.software_genre, 01 natural sciences, Ensemble learning, Language and Linguistics, Range (mathematics), n-gram, Artificial Intelligence, Simple (abstract algebra), 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Focus (optics), business, Joint (audio engineering), 010301 acoustics, computer, Software
الوصف: This paper provides an analysis of several practical issues related to the theory and implementation of Grapheme-to-Phoneme (G2P) conversion systems utilizing the Weighted Finite-State Transducer paradigm. The paper addresses issues related to system accuracy, training time and practical implementation. The focus is on joint n-gram models which have proven to provide an excellent trade-off between system accuracy and training complexity. The paper argues in favor of simple, productive approaches to G2P, which favor a balance between training time, accuracy and model complexity. The paper also introduces the first instance of using joint sequence RnnLMs directly for G2P conversion, and achieves new state-of-the-art performance via ensemble methods combining RnnLMs and n-gram based models. In addition to detailed descriptions of the approach, minor yet novel implementation solutions, and experimental results, the paper introducesPhonetisaurus, a fully-functional, flexible, open-source, BSD-licensed G2P conversion toolkit, which leverages the OpenFst library. The work is intended to be accessible to a broad range of readers.
تدمد: 1469-8110
1351-3249
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::9f8f540d9a6a494ee3368826fb9c8df4
https://doi.org/10.1017/s1351324915000315
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........9f8f540d9a6a494ee3368826fb9c8df4
قاعدة البيانات: OpenAIRE