In this paper we describe an efficient scheme for compression and flexible spatial rendering of audio signals. The method is based on Binaural Cue Coding (BCC) which was recently introduced for efficient compression of multi-channel audio signals. The encoder input consists of separate signals without directional spatial cues, such as separate sound source signals, i.e. several monophonic signals. The signal transmitted to the decoder consists of the mono sum-signal of all input signals plus a low bit rate (e.g. 2 kb/s) set of BCC parameters. The mono signal can be encoded with any conventional audio or speech coder. Using the BCC parameters and the mono signal, the BCC synthesizer can flexibly render a spatial image by determining the perceived direction of the audio content of each of the encoder input signals. We provide the results of an audio quality assessment using headphones, which is a more critical scenario than loudspeaker playback.
Authors:
Faller, Christof; Baumgarte, Frank
Affiliation:
Media Signal Processing Research, Agere Systems, Murray Hill, NJ
AES Convention:
113 (October 2002)
Paper Number:
5686
Publication Date:
October 1, 2002
Subject:
Low Bit-Rate Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.