When the number of available loudspeakers or transmission channels is smaller than the number of channels in an audio format, downmixing is required. If the audio in the channels contain nonaligned interdependent sounds, the downmixed signal may have perceptible spectral biases, such as that produced by a comb filter. A time–frequency domain, phase-adaptive downmixing technique is proposed to reduce such spectral effects. The technique aligns the phases of the input channel pairs or groups having a high measured normalized interchannel coherence prior to the downmixing. Simulations and listening tests were conducted to show the conditions in which the proposed method provides benefit with respect to the legacy methods. Computational evaluations showed that the method may be implemented in real time for a large number of channels using reasonable hardware. The target for the phase processing is weighted with the input channel amplitudes, and the phase coefficients are regularized over time and frequency to avoid processing artifacts.
Vilkamo, Juha; Kuntz, Achim; Füg, Simone
Affiliations: Aalto University, Espoo, Finland; Fraunhofer IIS, Erlangen, Germany(See document for exact affiliation information.)
JAES Volume 62 Issue 7/8 pp. 516-526; July 2014
Publication Date: August 22, 2014
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.