This paper describes a speech enhancement system based on a multichannel Wiener filter (MWF) embedded in a generalized sidelobe canceller (GSC). Instead of a fixed beamformer, the minimum variance distortionless response (MVDR) beamformer is used in the signal extracting path. Noise and reverberation covariance matrices are estimated for the MVDR beamformer. The noise power spectral density (PSD) is estimated in light of the spherically isotropic noise model and a blocking-based least-squaresmethod. The reverberation covariance matrix is established using a variance normalization delay linear prediction (NDLP) algorithm. To further reduce residual noise at the GSC output, a postfilter is cascaded. We compare the proposed enhancement system with five baseline methods, the integrated sidelobe cancellation and the linear prediction Kalman filter (ISCLP), the two-stage beamforming approach (TSBA), the blocking-based multichannel Wiener filter (BMWF), the GSC beamformer with a delay-and-sum (DS) as its fixed beamformer and a Wiener postfilter (DS-GSC-PF), and the GSC beamformer with a superdirective beamformer as its fixed beamformer and a Wiener postfilter (SB-GSC-PF). The experiment results have demonstrated that the proposed approach outperforms the ISCLP, TSBA, BMWF, DS-GSC-PF, and SB-GSC-PF methods in terms of objective quality evaluations.
Bai, Mingsian R.; Kung, Fan-Jie
Affiliations: Department of Electrical Engineering, National Tsing Hua University, Hsinchu 300044, Taiwan; Department of Electrical Engineering, National Tsing Hua University, Hsinchu 300044, Taiwan(See document for exact affiliation information.)
JAES Volume 70 Issue 3 pp. 140-155; March 2022
Publication Date: March 8, 2022
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.