In the application of conventional audio compression algorithms to low bit rate audio coding one is faced with the unsatisfactory tradeoff between coarser quantization and audio bandwidth reduction. BandwidthExtension has therefore emerged as an important tool for the satisfactory performance of low bit rate audio codecs. In this paper we describe one of a newer class of Frequency Extension techniques which are applied directly to the high frequency resolution representation of the signal (e.g., MDCT). This particular technique is based on a Fractal Self-Similarity Model (FSSM) for the short-term frequency representation of the signal and takes advantage of the high frequency resolution of the MDCT, namely in terms of parameter estimation.. The FSSM model, which may include multiple dilation and translation terms, has been found to be effective for a wide variety of speech and music signals and provides a compact description for long term correlation that may exist in frequency domain.. The Structure of the FSSM model is presented, issues related to parameter estimation, and its application to audio coding for bit rates of 8-48 kbps are discussed.
Authors:
Ferreira, Anibal J. S.; Sen, Deep; Sinha, Deepen
Affiliations:
ATC Labs; University of New South Wales/ATC Labs; University of Porto/ATC Labs(See document for exact affiliation information.)
AES Convention:
118 (May 2005)
Paper Number:
6467
Publication Date:
May 1, 2005
Subject:
Low Bit Rate Audio Coding (Research)
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.