In this paper we study the problem of estimating the distance of a sound source from a single microphone recording in a room environment. The room effect cannot be separated from the problem without making assumptions about the properties of the source signal. Therefore, it is necessary to develop methods of distance estimation separately for different types of source signals. In this paper, we focus on speech signals. The proposed solution is to compute a number of statistical and source specific features from the speech signal and to use pattern recognition techniques to develop a robust distance estimator for speech signals. Experiments with a database of real speech recordings showed that the proposed model is capable of estimating source distance with acceptable performance for applications such as ambient telephony.
Georganti, Eleftheria; May, Tobias; van de Par, Steven; Härmä, Aki; Mourjopoulos, John
Affiliations: Philips Research Europe, Eindhoven, The Netherlands; University of Patras, Patras, Greece; Technische Universiteit Eindhoven, Eindhoven, The Netherlands(See document for exact affiliation information.)
AES Convention: 126 (May 2009) Paper Number: 7689
Publication Date: May 1, 2009
Subject: Spatial Audio Processing
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.