دورية أكاديمية

Evaluating the Language ENvironment Analysis System for Korean

التفاصيل البيبلوغرافية
العنوان: Evaluating the Language ENvironment Analysis System for Korean
اللغة: English
المؤلفون: McDonald, Margarethe (ORCID 0000-0002-9620-8556), Kwon, Taeahn, Kim, Hyunji, Lee, Youngki, Ko, Eon-Suk (ORCID 0000-0003-3963-4492)
المصدر: Journal of Speech, Language, and Hearing Research. Mar 2021 64(3):792-808.
الإتاحة: American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
Peer Reviewed: Y
Page Count: 17
تاريخ النشر: 2021
نوع الوثيقة: Journal Articles
Reports - Research
Descriptors: Computational Linguistics, Korean, Audio Equipment, Accuracy, Error Patterns, Classification, Measures (Individuals), Infants, Foreign Countries, Interrater Reliability, Recall (Psychology), Speech Evaluation, Databases, Correlation, Language Acquisition
مصطلحات جغرافية: South Korea
DOI: 10.1044/2020_JSLHR-20-00489
تدمد: 1092-4388
مستخلص: Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds.
Abstractor: As Provided
Entry Date: 2021
رقم الأكسشن: EJ1294468
قاعدة البيانات: ERIC
الوصف
تدمد:1092-4388
DOI:10.1044/2020_JSLHR-20-00489