CHJ-WLSP : Annotation of 'Word List by Semantic Principles' Labels for the Corpus of Historical Japanese
العنوان: | CHJ-WLSP : Annotation of 'Word List by Semantic Principles' Labels for the Corpus of Historical Japanese |
---|---|
المؤلفون: | Asahara, Masayuki, Ikegami, Nao, Suzuki, Tai, Ichimura, Taro, Kondo, Asuko, Kato, Sachi, Yamazaki, Makoto |
المصدر: | Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages(LT4HALA 2022). :31-37 |
بيانات النشر: | European Language Resources Association, 2022. |
سنة النشر: | 2022 |
مصطلحات موضوعية: | Historical Japanese, Word Sense Annotation |
الوصف: | application/pdf National Institute for Japanese Language and Linguistics / Tokyo University of Foreign Studies Saitama University University of Tokyo Kyoto Prefectural University Mejiro University National Institute for Japanese Language and Linguistics This article presents a word-sense annotation for the Corpus of Historical Japanese: a mashed-up Japanese lexicon based on the 'Word List by Semantic Principles' (WLSP). The WLSP is a large-scale Japanese thesaurus that includes 98,241 entries with syntactic and hierarchical semantic categories. The historical WLSP is also compiled for the words in ancient Japanese. We utilized a morpheme-word sense alignment table to extract all possible word sense candidates for each word appearing in the target corpus. Then, we manually disambiguated the word senses for 647,751 words in the texts from the 10th century to 1910. |
وصف الملف: | application/pdf |
اللغة: | English |
URL الوصول: | https://explore.openaire.eu/search/publication?articleId=jairo_______::b53581526332227cc130aabdcfa57339 https://repository.ninjal.ac.jp/records/3617 |
حقوق: | OPEN |
رقم الأكسشن: | edsair.jairo.........b53581526332227cc130aabdcfa57339 |
قاعدة البيانات: | OpenAIRE |
الوصف غير متاح. |