Techniques where a stereo or a multichannel signal is decomposed into spatial source-labeled time-frequency slots by level, time-difference, and coherence metrics have become popular in recent years. Good examples are binaural cue coding and up/downmixing techniques. In the article, we will provide an overview and discuss parallel approaches in the field of array processing and blind source separation. Typically, time-frequency slots are formed from subband representations of signals. However, it is also possible to produce a similar spatial decomposition for a parametric representation (sinusoids, transients, and noise) of a stereo or multichannel audio signal. Advantages and disadvantages of the two approaches for audio coding applications are discussed in this article.
Authors:
Harma, Aki; Faller, Christof
Affiliations:
Laboratory of Acoustics and Audio Signal Processing, Helsinki University of Technology, HUT, Finland ; Mobile Terminals Division, Agere Systems, Allentown, PA,(See document for exact affiliation information.)
AES Convention:
116 (May 2004)
Paper Number:
6061
Publication Date:
May 1, 2004
Subject:
Low bit-Rate Audio Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.