Community

AES Convention Papers Forum

Real-Time Reverb Reduction for Improved Automatic Speech Recognition in Far-Field

In the paper, methods of real-time reverb reduction based on Generalized Weighted Prediction Error (GWPE) were presented. It was shown that usage of the proposed audio processing routines highly improve the accuracy of Automatic Speech Recognition (ASR) system namely word error rates (WERs) are reduced 11.36% when the user stands 5 meters from the microphone array. The obtained results are close to the ones that are achieved by the offline GWPE implementation (12.06%). Thanks to optimizations and parameters tuning, computational complexity of the proposed realization of GWPE was highly reduced and it achieves RTFs lower than 1.0 (computation time is shorter than signal duration) when using one core of CPU.

Authors: Kupryjanow, Adam; Maziewski, Przemyslaw; Kurylo, Lukasz; Lasota, Piotr
Affiliation: Intel Technology Poland, Gdansk, Poland
AES Convention: 142 (May 2017) Paper Number: 9803
Publication Date: May 11, 2017
Subject: Miscellaneous 2

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

Navigation

AES Convention Papers Forum

Real-Time Reverb Reduction for Improved Automatic Speech Recognition in Far-Field

Subscribe to this discussion

Start a discussion!

ABOUT AES

Contact Us