Non-Intrusive Speech Quality Assessment withAttention-based ResNet-BiLSTM

التفاصيل البيبلوغرافية
العنوان: Non-Intrusive Speech Quality Assessment withAttention-based ResNet-BiLSTM
المؤلفون: Kailai Shen, Diqun Yan, Zhe Ye, Xianbo Xu, JinXing Gao, Li Dong, Chengbin Peng, Kun Yang
بيانات النشر: Research Square Platform LLC, 2022.
سنة النشر: 2022
الوصف: Speech quality is frequently affected by a variety factors in online conferencing applications, such as background noise, reverberation, packet loss, network jitter and so on. In real scenarios, it is impossible to obtain a clean reference signal to evaluating the quality of the conferencing speech. Therefore, an effective non-intrusive speech quality (NISQA) method is necessary. In this paper, we propose a new network framework for NISQA based on ResNet and BiLSTM. ResNet is utlized to extract local features, while BiLSTM is used to integrate representative features with long-term time dependencies and sequential characteristics. Considering that ResNet may result in the loss of context information when applied to the NISQA task, we propose a variant of ResNet which can preserve the time series information of the conferencing speech. The experimental results demonstrate that the proposed method has a high correlation with the mean opinion score (MOS) of clean, noisy and processed speech.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::34773aa1ce42f7f961b9d2f4b5674ef9
https://doi.org/10.21203/rs.3.rs-2170880/v1
حقوق: OPEN
رقم الأكسشن: edsair.doi...........34773aa1ce42f7f961b9d2f4b5674ef9
قاعدة البيانات: OpenAIRE