A new adaptive technique for speech capturing in adverse conditions with microphone arrays is presented in this paper. The proposed technique is based on frequency-domain alignment of microphone signals with the output of fixed beamformer directed to the target speaker. This alignment procedure improves pattern directivity and reduces sidelobes. The low complexity of the technique is achieved by means of frequency-domain implementation of algorithms. This makes it possible to implement this technique in real-time forensic applications with a large number of microphones. This technique was evaluated on speech data corrupted by varying levels and directions of noise and interference. Experimental results with an 8-microphone array show suppression of diffuse noise by about 23 dB and suppression of wide band interference up to 17 dB.
Authors:
Stolbov, Mikhail; Aleinik, Sergei
Affiliations:
National Research University of Information Technologies, Mechanics and Optics, St. Petersburg, Russia; Speech Technology Center, St. Petersburg, Russia(See document for exact affiliation information.)
AES Conference:
54th International Conference: Audio Forensics (June 2014)
Paper Number:
5-1
Publication Date:
June 12, 2014
Subject:
Speech Intelligibility
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.