The success of a perceptual audio coder depends strongly on the joint efficiency of three of its main building blocks: the analysis/synthesis filter bank, the psychoacoustic model and the quantizer. This paper focuses on the combined choice of an appropriate filter bank and an accurate psychoacoustic model. A few statistical results concerning stationarity and harmonicity are used to discuss the objective and subjective coding gain, the necessity to address multiresolution, and the pertinence of including aspects of pitch and timbre perception in perceptual modeling. Specific solutions are proposed.
Author:
Ferreira, Anibal J. S.
Affiliation:
FEUP/INESC, Porto, Portugal
AES Convention:
104 (May 1998)
Paper Number:
4671
Publication Date:
May 1, 1998
Subject:
Perceptual Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.