Community

AES Convention Papers Forum

Harmonic Representation and Auditory Model-Based Parametric Matching and Its Application in Speech/Audio Analysis

The paper presents new methods for the selection of sinusoids and transients components in hybrid sinusoidal modeling of speech/audio. The instantaneous harmonic parameters (magnitude, frequency and phase) are calculated as the result of the narrow band filtering of speech/audio. The frequency-modulated filters synthesis with the closed form impulse response has been proposed. The filter frequency bounds can be determined during the components frequency tracking and can be adjusted according to the fundamental frequency modulations. It can be implemented speech/audio harmonic/noise decomposition. The transient components modeling are presented by matching pursuit with frame-based psychoacoustic optimized wavelet packet dictionary. The choice of most relevant coefficients is based on maximizing the matching between the auditory excitation scalograms of original and modeled signals.

Authors: Petrovsky, Alexey; Azarov, Elias; Petrovsky, Alexander
Affiliations: Bialystok Technical University, Bialystok, Poland; Belarusian State University of Informatics and Radioelectronics, Minsk, Belarus(See document for exact affiliation information.)
AES Convention: 126 (May 2009) Paper Number: 7705
Publication Date: May 1, 2009
Subject: Audio for Telecommunications

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

Navigation

AES Convention Papers Forum

Harmonic Representation and Auditory Model-Based Parametric Matching and Its Application in Speech/Audio Analysis

Subscribe to this discussion

Start a discussion!

ABOUT AES

Contact Us