MaiBaam Annotation Guidelines

التفاصيل البيبلوغرافية
العنوان: MaiBaam Annotation Guidelines
المؤلفون: Blaschke, Verena, Kovačić, Barbara, Peng, Siyao, Plank, Barbara
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: This document provides the annotation guidelines for MaiBaam, a Bavarian corpus annotated with part-of-speech (POS) tags and syntactic dependencies. MaiBaam belongs to the Universal Dependencies (UD) project, and our annotations elaborate on the general and German UD version 2 guidelines. In this document, we detail how to preprocess and tokenize Bavarian data, provide an overview of the POS tags and dependencies we use, explain annotation decisions that would also apply to closely related languages like German, and lastly we introduce and motivate decisions that are specific to Bavarian grammar.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2403.05902
رقم الأكسشن: edsarx.2403.05902
قاعدة البيانات: arXiv