أعرض في EDS

Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring

التفاصيل البيبلوغرافية
العنوان:	Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring
المؤلفون:	Yaman Kumar Singla, Changyou Chen, Rajiv Ratn Shah, Balaji Krishnamurthy, Shaurya Bagga, Avyakt Gupta
المصدر:	CIKM
بيانات النشر:	arXiv, 2021.
سنة النشر:	2021
مصطلحات موضوعية:	FOS: Computer and information sciences, Computer Science - Machine Learning, Sound (cs.SD), Computer science, media_common.quotation_subject, Context (language use), computer.software_genre, Computer Science - Sound, Domain (software engineering), Task (project management), Machine Learning (cs.LG), Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, media_common, Computer Science - Computation and Language, Grammar, business.industry, Deep learning, Variable (computer science), Face (geometry), Artificial intelligence, business, Focus (optics), computer, Computation and Language (cs.CL), Natural language processing, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف:	Automatic Speech Scoring (ASS) is the computer-assisted evaluation of a candidate's speaking proficiency in a language. ASS systems face many challenges like open grammar, variable pronunciations, and unstructured or semi-structured content. Recent deep learning approaches have shown some promise in this domain. However, most of these approaches focus on extracting features from a single audio, making them suffer from the lack of speaker-specific context required to model such a complex task. We propose a novel deep learning technique for non-native ASS, called speaker-conditioned hierarchical modeling. In our technique, we take advantage of the fact that oral proficiency tests rate multiple responses for a candidate. We extract context vectors from these responses and feed them as additional speaker-specific context to our network to score a particular response. We compare our technique with strong baselines and find that such modeling improves the model's average performance by 6.92% (maximum = 12.86%, minimum = 4.51%). We further show both quantitative and qualitative insights into the importance of this additional context in solving the problem of ASS. Comment: Published in CIKM 2021
DOI:	10.48550/arxiv.2109.00928
URL الوصول:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3efbed74975ed52ead5f0542adcd50a8
حقوق:	OPEN
رقم الأكسشن:	edsair.doi.dedup.....3efbed74975ed52ead5f0542adcd50a8
قاعدة البيانات:	OpenAIRE

الوصف
DOI:	10.48550/arxiv.2109.00928