An approach to refine and adapt an existing music sound source separation algorithm to speech enhancement is presented. The existing algorithm has the capability to extract music sources from stereo recordings using the position of the sources in the stereo field. Described in this paper is the ability of a modified Azimuth Discrimination and Resynthesis algorithm (m-ADRess) to enhance speech in the presence of noise using a two-microphone array. Also proposed is a novel extension to the algorithm, which enables further noise removal from speech based on elevation angle of arrival. Objective measures and an informal listening test of processed speech show the suitability of m-ADRess for cleaning noisy speech mixtures in an anechoic environment.
Authors:
Cahill, Niall; Cooney, Rory; Humphreys, Kenneth; Lawlor, Robert
Affiliation:
National University of Ireland, Maynooth
AES Convention:
121 (October 2006)
Paper Number:
6961
Publication Date:
October 1, 2006
Subject:
Computers & Mobile Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.