Automatic identification of audio titles on radio broadcasts is a first step towards automatic annotation of radio programmes. Systems designed for the purpose of identification have to deal with a variety of post-processing potentially imposed on audio material at the radio stations. One of the more difficult techniques to be handled is time-scaling, i.e., the variation of playback speed. In this paper we propose a robust fingerprinting technique designed for the identification of time-scaled audio data. To allow for fast time-scale invariant audio dentification, the extracted fingerprints are used as an input to an algebraic indexing technique that has already been successfully applied to the task of audio identification.
Author:
Bardeli, Rolf
Affiliation:
Department of Computer Science III, University of Bonn, Bonn, Germany
AES Conference:
25th International Conference: Metadata for Audio (June 2004)
Paper Number:
4-2
Publication Date:
June 1, 2004
Subject:
Metadata for Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.