Total Least Squares (TLS) algorithms automatically decompose (audio) frames into a number of exponentially damped sinusoids. This can provide for more efficient modeling than plain sinusoidal modeling, especially in the case of transitional frames. Straightforward implementations of TLS optimize a SNR criterion. In our implementation we apply TLS in a subband scheme in which the number of damped sinusoids is both frame and subband dependent. This is made possible through the use of perceptual information provided by the MPEG-I psycho-acoustic model I. Experiments on different audio tracks provide proof of concept for our perceptual ESM, and illustrate the significant reduction in modeling components compared to a non-perceptual ESM.
Authors:
Hermus, Kris; Verhelst, Werner; Wambacq, Patrick
Affiliations:
Katholieke Universiteit Leuven, dept. ESAT - div. PSI, Leuven, BELGIUM ; Vrije Universiteit Brussel, dept. ETRO - div. DSSP, Brussels, BELGIUM(See document for exact affiliation information.)
AES Convention:
112 (April 2002)
Paper Number:
5571
Publication Date:
April 1, 2002
Subject:
Low Bit-Rate Audio Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.