دورية أكاديمية
Databases on the Indonesian Prefixes PE- and PEN
العنوان: | Databases on the Indonesian Prefixes PE- and PEN |
---|---|
المؤلفون: | Karlina Denistia |
المصدر: | Journal of Language and Literature, Vol 23, Iss 1, Pp 13-24 (2023) |
بيانات النشر: | Prodi Sastra Inggris Fakultas Sastra Universitas Sanata Dharma, 2023. |
سنة النشر: | 2023 |
المجموعة: | LCC:Language. Linguistic theory. Comparative grammar |
مصطلحات موضوعية: | corpus data, morphology, prefixes, cosine similarity, Language. Linguistic theory. Comparative grammar, P101-410 |
الوصف: | This paper provides the theoretical grounding in constituting databases related to PE- and PEN-, two Indonesian nominalizing prefixes, which have various meanings (e.g., patient, agent, or instrument). The first database contains the words with PE- and PEN- whereas the second database provides the cosine similarity between two words of interest. Using a written Indonesian corpus as the primary source (Leipzig Corpora Collection), the databases contain the following information: PE- or PEN- prefixes, allomorph of PEN-, base word, semantics role, morphological variation, cosine similarity, as well as the word frequency. Furthermore, this paper elaborates the theoretical consideration on how each information was cultivated. In building the databases, Indonesian morphological parser and Word to Vector were used to analyze the Indonesian morphological status and to put the words in the corpus into a vector. In addition, manual verification for the data against the Indonesian comprehensive dictionary was also conducted. In the end, the databases are available for free so that the data could be used as materials for a corpus-based analysis on Indonesian morphology. This research shed light to a careful and thorough classification of the open-access databases of PE- and PEN- from their allomorphs, base word, semantics role, and morphological variation. The information provided in this article is hoped to be contributive in Indonesian morphology specifically, and other linguistics fields (e.g., corpus linguistics and quantitative linguistics) in general. |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 1410-5691 2580-5878 |
Relation: | https://e-journal.usd.ac.id/index.php/JOLL/article/view/4967; https://doaj.org/toc/1410-5691; https://doaj.org/toc/2580-5878 |
DOI: | 10.24071/joll.v23i1.4967 |
URL الوصول: | https://doaj.org/article/e46674466e5c48e5bfc463417e42d8e9 |
رقم الأكسشن: | edsdoj.46674466e5c48e5bfc463417e42d8e9 |
قاعدة البيانات: | Directory of Open Access Journals |
تدمد: | 14105691 25805878 |
---|---|
DOI: | 10.24071/joll.v23i1.4967 |