دورية أكاديمية

Evaluating Robustness to Noise and Compression of Deep Neural Networks for Keyword Spotting

التفاصيل البيبلوغرافية
العنوان: Evaluating Robustness to Noise and Compression of Deep Neural Networks for Keyword Spotting
المؤلفون: Pedro H. Pereira, Wesley Beccaro, Miguel A. Ramirez
المصدر: IEEE Access, Vol 11, Pp 53224-53236 (2023)
بيانات النشر: IEEE, 2023.
سنة النشر: 2023
المجموعة: LCC:Electrical engineering. Electronics. Nuclear engineering
مصطلحات موضوعية: Speech recognition, machine learning algorithms, speech analysis, spectral analysis, pruning, quantization, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
الوصف: Keyword Spotting (KWS) has been the subject of research in recent years given the increase of embedded systems for command recognition such as Alexa, Google Home, and Siri. Performance, model size, processing time, and robustness to noise are fundamental in these systems. Furthermore, applications in embedded systems demand computationally efficient models that can be implemented in current technology. In this work, an approach for keyword recognition is evaluated using three deep learning models namely LeNet-5, SqueezeNet, and EfficientNet-B0. We evaluate transfer learning, pruning and quantization strategies in training and test using noisy and clean speech signals. In addition, compression techniques such as pruning and quantization were assessed in terms of the size reduction of the model footprint and the accuracy obtained in each case. Using the Google’s Speech Commands dataset and additive babble noise signal, our keyword recognition approach achieves an accuracy of 94.6% using an unstructured pruning of 80% of the parameters of the original SqueezeNet network with a reduction of 70% in the model size.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2169-3536
Relation: https://ieeexplore.ieee.org/document/10136724/; https://doaj.org/toc/2169-3536
DOI: 10.1109/ACCESS.2023.3280477
URL الوصول: https://doaj.org/article/2736045f2f054af3b7cfb9edd725d9d7
رقم الأكسشن: edsdoj.2736045f2f054af3b7cfb9edd725d9d7
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21693536
DOI:10.1109/ACCESS.2023.3280477