The paper presents new methods for the selection of sinusoids and transients components in hybrid sinusoidal modeling of speech/audio. The instantaneous harmonic parameters (magnitude, frequency and phase) are calculated as the result of the narrow band filtering of speech/audio. The frequency-modulated filters synthesis with the closed form impulse response has been proposed. The filter frequency bounds can be determined during the components frequency tracking and can be adjusted according to the fundamental frequency modulations. It can be implemented speech/audio harmonic/noise decomposition. The transient components modeling are presented by matching pursuit with frame-based psychoacoustic optimized wavelet packet dictionary. The choice of most relevant coefficients is based on maximizing the matching between the auditory excitation scalograms of original and modeled signals.
Petrovsky, Alexey; Azarov, Elias; Petrovsky, Alexander
Affiliations: Bialystok Technical University, Bialystok, Poland; Belarusian State University of Informatics and Radioelectronics, Minsk, Belarus(See document for exact affiliation information.)
AES Convention: 126 (May 2009) Paper Number: 7705
Publication Date: May 1, 2009
Subject: Audio for Telecommunications
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.