The modified discrete cosine transform (MDCT) is often used for audio coding due to its critical sampling property and good energy compaction, especially for harmonic tones with constant fundamental frequencies (pitch). However, in voiced human speech the pitch is time-varying and thus the energy is spread over several transform coefficients, leading to a reduction of coding efficiency. The approach presented herein compensates for pitch variation in each MDCT block by application of time-variant re-sampling. A dedicated signal adaptive transform window computation ensures the preservation of the time domain aliasing cancelation (TDAC) property. Re-sampling can be designed such that the duration of the processed blocks is not altered, facilitating the replacement of the conventional MDCT in existing audio coders.
Authors:
Edler, Bernd; Disch, Sascha; Bayer, Stefan; Fuchs, Guillaume; Geiger, Ralf
Affiliations:
Leibniz Universitaet Hannover, Hannover, Germany; Fraunhofer IIS, Erlangen, Germany(See document for exact affiliation information.)
AES Convention:
126 (May 2009)
Paper Number:
7710
Publication Date:
May 1, 2009
Subject:
Audio Coding
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.