A method for scaling the level of the virtual center in an audio signal is proposed. The multichannel input signal is processed in the time-frequency domain by applying time and frequency dependent real-valued spectral weights that implicitly depend on the panning factors and the diffuseness of the input signal. Different methods to compute the spectral weights are presented and evaluated. In all cases, the spectral weights depend on the ratio of the power of a linear combination of the individual channel signals and the power of a passive downmix signal. Applications of the presented method are upmixing, stereophonic enhancement, dialogue enhancement, and pre-processing for semantic audio analysis.
Uhle, Christian; Habets, Emanuël
Affiliation: International Audio Laboratories Erlangen, Erlangen, Germany
AES Conference: 53rd International Conference: Semantic Audio (January 2014)
Paper Number: P2-7
Publication Date: January 27, 2014
Subject: Audio Source Separation
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.