دورية أكاديمية

Databases on the Indonesian Prefixes PE- and PEN

التفاصيل البيبلوغرافية
العنوان: Databases on the Indonesian Prefixes PE- and PEN
المؤلفون: Karlina Denistia
المصدر: Journal of Language and Literature, Vol 23, Iss 1, Pp 13-24 (2023)
بيانات النشر: Prodi Sastra Inggris Fakultas Sastra Universitas Sanata Dharma, 2023.
سنة النشر: 2023
المجموعة: LCC:Language. Linguistic theory. Comparative grammar
مصطلحات موضوعية: corpus data, morphology, prefixes, cosine similarity, Language. Linguistic theory. Comparative grammar, P101-410
الوصف: This paper provides the theoretical grounding in constituting databases related to PE- and PEN-, two Indonesian nominalizing prefixes, which have various meanings (e.g., patient, agent, or instrument). The first database contains the words with PE- and PEN- whereas the second database provides the cosine similarity between two words of interest. Using a written Indonesian corpus as the primary source (Leipzig Corpora Collection), the databases contain the following information: PE- or PEN- prefixes, allomorph of PEN-, base word, semantics role, morphological variation, cosine similarity, as well as the word frequency. Furthermore, this paper elaborates the theoretical consideration on how each information was cultivated. In building the databases, Indonesian morphological parser and Word to Vector were used to analyze the Indonesian morphological status and to put the words in the corpus into a vector. In addition, manual verification for the data against the Indonesian comprehensive dictionary was also conducted. In the end, the databases are available for free so that the data could be used as materials for a corpus-based analysis on Indonesian morphology. This research shed light to a careful and thorough classification of the open-access databases of PE- and PEN- from their allomorphs, base word, semantics role, and morphological variation. The information provided in this article is hoped to be contributive in Indonesian morphology specifically, and other linguistics fields (e.g., corpus linguistics and quantitative linguistics) in general.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1410-5691
2580-5878
Relation: https://e-journal.usd.ac.id/index.php/JOLL/article/view/4967; https://doaj.org/toc/1410-5691; https://doaj.org/toc/2580-5878
DOI: 10.24071/joll.v23i1.4967
URL الوصول: https://doaj.org/article/e46674466e5c48e5bfc463417e42d8e9
رقم الأكسشن: edsdoj.46674466e5c48e5bfc463417e42d8e9
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:14105691
25805878
DOI:10.24071/joll.v23i1.4967