This publication presents an overview and an evaluation of low-level features characterizing the noiselike or tonelike nature of an audio signal. Such features are widely used for content classification, segmentation, identification, coding of audio signals, blind source separation, speech enhancement and voice activity detection. Besides the widely used Spectral Flatness Measure various alternative descriptors exist. These features are reviewed and the requirements for these features are discussed. The features in scope are evaluated using synthetic signals and exemplarily real-world application related to audio content classification, namely voiced-unvoiced discrimination for speech signals and speech detection.
Author:
Uhle, Christian
Affiliation:
Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany
AES Convention:
128 (May 2010)
Paper Number:
8035
Publication Date:
May 1, 2010
Subject:
Audio Processing—Analysis and Synthesis of Sound
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.