In the context of telephony applications or movie sound recording, speech reverberation is problematic as it yields to a drop in both intelligibility and quality of the captured sound. Blind dereverberation is desirable, since the room impulse response is not known apriori or is changing unexpectedly. The use of higher order statistics (HOS) has proven effective in various areas of speech and signal processing, whenever dealing with a mixture of Gaussian and non-Gaussian processes or system non-linearity. The present paper extends on and offers a new variation of some of the existing work in the area of speech dereverberation using a Kurtosis maximization approach. The contribution of this work is in formulating the problem in the complex subband domain, based on new derived properties of the HOS, in a way that does not require an additional LPC analysis and residual filtering as with other methods. A complex normalized version of the kurtosis is used, as well as a variable adaptation factor based on the statistics of the error. The resulting scheme is effective in a number of tested scenarios and may potentially be extended to other audio signals.
Author:
Nemer, Elias J.
Affiliation:
DTS, Calabasas, CA, USA
AES Conference:
60th International Conference: DREAMS (Dereverberation and Reverberation of Audio, Music, and Speech) (January 2016)
Paper Number:
7-1
Publication Date:
January 27, 2016
Subject:
Paper Session 7
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.