We propose a framework for matching a single PCM audio stream against multiple candidate audio streams. This framework allows for a real-time identification of a single audio stream w.r.t. the candidate streams. The identification is robust to signal delays of up to several seconds as well as to signal distortions due to lossy coding, a noisy environment, or analog transmission. An area of application is the query-by-mobile-phone scenario where a user transmits an audio stream recorded from the radio using his mobile phone as a recording device. The transmitted audio stream may then be identified using the proposed framework by real-time matching of the audio stream to all possible radio programmes.
Authors:
Kurth, Frank; Scherzer, Roman
Affiliation:
Department of Computer Science III, University of Bonn, Bonn, Germany
AES Convention:
114 (March 2003)
Paper Number:
5821
Publication Date:
March 1, 2003
Subject:
Computer Audio and Networks
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.