تقرير
Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations
العنوان: | Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations |
---|---|
المؤلفون: | Götz, Philipp, Tuna, Cagdas, Brendel, Andreas, Walther, Andreas, Habets, Emanuël A. P. |
سنة النشر: | 2024 |
مصطلحات موضوعية: | Electrical Engineering and Systems Science - Audio and Speech Processing |
الوصف: | We present a method for blind acoustic parameter estimation from single-channel reverberant speech. The method is structured into three stages. In the first stage, a variational auto-encoder is trained to extract latent representations of acoustic impulse responses represented as mel-spectrograms. In the second stage, a separate speech encoder is trained to estimate low-dimensional representations from short segments of reverberant speech. Finally, the pre-trained speech encoder is combined with a small regression model and evaluated on two parameter regression tasks. Experimentally, the proposed method is shown to outperform a fully end-to-end trained baseline model. Comment: Accepted for publication at IWAENC 2024 |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2407.19989 |
رقم الأكسشن: | edsarx.2407.19989 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |