Community

AES Conference Papers Forum

Discrimination Module for Voice/Audio Signals Based on Wavelet Ridges Analysis

Document Thumbnail

"Low bit-rate at high quality perception is the aim of coding schemes. Traditionally audio and voice coders have evolved as different paradigms: the state of the art voice-coders cores resides in Algebraic Code Excited Linear Prediction (ACELP) technologies while audio coding has its core in Transform Coding (TC). The Unified Speech-Audio Coding (USAC) scheme has become a new paradigm where the principal goal is to choose between the ACELP or TC to reduce the bit rate and increase the high quality perception. This modern coder is based in a module that decides which core coder to use on a specific signal frame. This paper proposes a decision module based on ridges detection in the wavelet transform of the input signal. Wavelet ridges permit to track the instantaneous frequencies contained in the analyzed signal. These instantaneous frequencies, linked to the signal pitch and its harmonics, permit to establish a module for determining whether it is a voice signal or audio."

Authors:
Affiliation:
AES Conference:
Paper Number:
Publication Date:
Subject:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:
Username:
Password:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society