AES Journal Forum

A Frequency-Domain Approach to Multichannel Upmix

Document Thumbnail

A series of upmixing techniques for generating multichannel audio from stereo recordings are proposed. The techniques use a common analysis framework based on a comparison between the short-time Fourier transforms of the left and right stereo signals. An interchannel coherence measure is used to identify time-frequency regions consisting mostly of ambience components, which can then be weighted via a nonlinear mapping function, and extracted to synthesize ambience signals. A similarity measure is used to identify the panning coefficients of the various sources in the mix in the time-frequency plane, and different heuristic mapping functions are applied to unmix (extract) one or more sources, and perceptually based functions to repan the signals into an arbitrary number of channels. We illustrate the application of the various techniques in the design of a two-to-five channel upmix system.

JAES Volume 52 Issue 7/8 pp. 740-749; July 2004
Publication Date:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society