Spatial audio coding (SAC) is a process to represent multichannel audio signals as down-mixed mono or stereo signals with spatial cues. The main strength of SAC is the significant bit-rate reduction while maintaining the perceptual sound quality. Binaural cue coding (BCC) has been introduced and now becomes an important scheme for multichannel SAC both in the sense of audio coding and the standardization issue in the MPEG. However, interchannel level difference (ICLD), one of the essential spatial cues for SAC, has a limitation that the quantized ICLD for transmission may lead to the sound quality degradation of decoded signal. In this paper, we propose virtual source location information (VSLI), which is an angle representing geometric spatial information between channels on playback layout, instead of the ICLD, and also a VSLI-based SAC system. Since a human being can not easily distinguish the variation of the spatial angle within the three degree distortion, the spatial angle, hence the VSLI, can be approximated discretely with the three degree resolution while maintaining the perceptual quality of output signals. The objective and subjective assessment results of our proposed system confirm superior performance to the ICLD-based SAC system.
Authors:
Jang, Inseon; Kang, Kyeongok; Seo, Jeongil
Affiliation:
ETRI
AES Convention:
119 (October 2005)
Paper Number:
6576
Publication Date:
October 1, 2005
Subject:
Audio Coding & Loudspeakers & Hi Resolution Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.