This paper explores the feasibility of using synchronization of speech mixtures prior to blind sparse source separation methods in order to improve their results. Broadly, methods that assume sparse sources use level and phase differences between mixtures as their features, and they separate signals from them. If each mixture is considerably delayed with respect to the rest of them, the information extracted from these differences can be wrong. With this idea in mind, this paper will focus on using Time Delay Estimation algorithms in order to synchronize the mixtures and observing the improvement that it provokes in a Blind Sparse Source Separation algorithm. The results obtained show the feasibility of using synchronization of the speech mixtures.
Authors:
Llerena, Cosme; Álvarez, Lorena; Gil-Pita, Roberto; Rosa-Zurera, Manuel
Affiliation:
University of Alcalá, Alcala de Henares (Madrid), Spain
AES Convention:
134 (May 2013)
Paper Number:
8834
Publication Date:
May 4, 2013
Subject:
Speech Processing
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.