The localization of sound sources, and particularly speech, has a numerous number of applications to the industry. This has motivated a continuous effort in developing robust direction-of-arrival detection algorithms. Time difference of arrival-based methods, and particularly, generalized cross-correlation approaches have been widely investigated in acoustic signal processing. Once a probability function is obtained, indicating those directions of arrival with highest probability, the vast majority of methods have to assume a certain number of sound sources in order to process the information conveniently. In this paper, a model selection based on a Bayesian framework is proposed in order to determine, in an unsupervised way, how many sound sources are estimated together with the parameters estimation. Real measurements using two microphones are used to corroborate the proposed model.
Authors:
Escolano, José; Cobos, Máximo; Pérez-Lorenzo, Jose M.; López, José J.; Xiang, Ning
Affiliations:
Rensselaer Polytechnic Institute, Troy, NY, USA; Universidad Politécnica de Valencia, Valencia, Spain; University of Jaen, Linares, Spain, ; University of Jaén; University of Valencia, Valencia, Spain(See document for exact affiliation information.)
AES Convention:
132 (April 2012)
Paper Number:
8668
Publication Date:
April 26, 2012
Subject:
Spatial Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.