Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations

التفاصيل البيبلوغرافية
العنوان: Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations
المؤلفون: Götz, Philipp, Tuna, Cagdas, Brendel, Andreas, Walther, Andreas, Habets, Emanuël A. P.
سنة النشر: 2024
مصطلحات موضوعية: Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: We present a method for blind acoustic parameter estimation from single-channel reverberant speech. The method is structured into three stages. In the first stage, a variational auto-encoder is trained to extract latent representations of acoustic impulse responses represented as mel-spectrograms. In the second stage, a separate speech encoder is trained to estimate low-dimensional representations from short segments of reverberant speech. Finally, the pre-trained speech encoder is combined with a small regression model and evaluated on two parameter regression tasks. Experimentally, the proposed method is shown to outperform a fully end-to-end trained baseline model.
Comment: Accepted for publication at IWAENC 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.19989
رقم الأكسشن: edsarx.2407.19989
قاعدة البيانات: arXiv