It is often desired to detect some particular short sound events from an audio recording. For example, in music analysis and processing, one may be interested in detection of percussive events. In environmental audio analysis one may look for individual sound events related to some activity, for example, sounds of footsteps from a walking person. Generally these problems can be solved by matching some prototype time-frequency (TF) patterns to a TF representation of the input signals to obtain time-varying probability functions for the prototype events. The method introduced in this paper is based on a small number of locally collected event patterns that are used directly to dene features for weighted cascade of weak classiffiers that is trained using the AdaBoost algorithm. The results of a comparison to a traditional sound event classier based on the mel-frequency cepstrum coecients and a hidden Markov model are very encouraging.
Author:
Härmä, Aki
Affiliation:
Philips Research Europe, Eindhoven, The Netherlands
AES Conference:
45th International Conference: Applications of Time-Frequency Processing in Audio (March 2012)
Paper Number:
1-5
Publication Date:
March 1, 2012
Subject:
Processing of Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.