The performance of perceptual audio coders depends on the efficiency of the quantization operation in masking the quantization noise under the audio signal. This objective is better addressed by coding separately different signal components such as sinusoids, transients and stationary noise. In this paper we use an audio coder that normalizes the MDCT spectrum by a smooth spectral envelope and by periodicities due to sinusoids. The resulting flattened MDCT coefficients are shown to exhibit a probability density function with small uncertainty allowing the design of an optimum non-uniform scalar quantizer. Its distortion--rate function is derived, is compared to that of of known quantizers, and compared to that obtained under real audio coding conditions.
Author:
Ferreira, Anibal J. S.
Affiliation:
School of Engineering of the University of Porto (FEUP-DEEC) / INESE Porto, Porto, Portugal
AES Convention:
115 (October 2003)
Paper Number:
5988
Publication Date:
October 1, 2003
Session Subject:
Psychoacoustics; Audio Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.