We describe a frequency-domain method for phase-amplitude matrix decoding and up-mixing of two-channel stereo recordings, based on spatial analysis of 2-D or 3-D directional and ambient cues in the recording, and re-synthesis of these cues for consistent reproduction over any loudspeaker or headphone playback system. The decoder is compatible with existing two-channel phase-amplitude stereo formats; however, unlike existing time-domain decoders, it preserves source separation and allows accurate reproduction of ambience and reverberation cues. The two-channel spatial encoding/decoding scheme is extended to incorporate 3-D elevation, without relying on HRTF cues. Applications include data-efficient storage or transmission of multi-channel soundtracks and computationally-efficient interactive audio spatialization in a backward-compatible stereo encoding format.
Authors:
Jot, Jean-Marc; Krishnaswami, Arvindh; Laroche, Jean; Merimaa, Juha; Goodwin, Michael M.
Affiliation:
Creative Advanced Technology Center
AES Convention:
123 (October 2007)
Paper Number:
7276
Publication Date:
October 1, 2007
Subject:
Signal Processing for 3-D Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.