دورية أكاديمية

Vision-Based Assistance for Vocal Fold Identification in Laryngoscopy with Knowledge Distillation.

التفاصيل البيبلوغرافية
العنوان: Vision-Based Assistance for Vocal Fold Identification in Laryngoscopy with Knowledge Distillation.
المؤلفون: Thao Thi Phuong DAO, Minh-Khoi PHAM, Mai-Khiem TRAN, Chanh Cong Ha, Boi Ngoc VAN, Bich Anh TRAN, Minh-Triet TRAN
المصدر: Studies in Health Technology & Informatics; 2023, Vol. 310, p946-950, 5p
مستخلص: Laryngoscopy images play a vital role in merging computer vision and otorhinolaryngology research. However, limited studies offer laryngeal datasets for comparative evaluation. Hence, this study introduces a novel dataset focusing on vocal fold images. Additionally, we propose a lightweight network utilizing knowledge distillation, with our student model achieving around 98.4% accuracy-comparable to the original EfficientNetB1 while reducing model weights by up to 88%. We also present an AI-assisted smartphone solution, enabling a portable and intelligent laryngoscopy system that aids laryngoscopists in efficiently targeting vocal fold areas for observation and diagnosis. To sum up, our contribution includes a laryngeal image dataset and a compressed version of the efficient model, suitable for handheld laryngoscopy devices. [ABSTRACT FROM AUTHOR]
Copyright of Studies in Health Technology & Informatics is the property of IOS Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:09269630
DOI:10.3233/SHTI231104