We describe an efficient system, which directly extracts features from compressed audio material. It consists of a time/frequency conversion method and a feature extraction algorithm. The conversion method provides the feature extraction algorithm with a suitable complex spectral representation directly from the compressed domain. It further allows to trade-off between computational complexity and conversion accuracy. Several operating points using different conversion accuracies were tested with an MPEG audio identification system in order to evaluate the identification confidence. Based on these results it is possible to reduce the computational complexity from O(N log N) to O(N) compared to the conventional approach (complete decoding followed by a frequency analysis).
Authors:
Friedrich, Tobias; Gruhne, Matthias; Schuller, Gerald
Affiliation:
Fraunhofer IDMT
AES Convention:
124 (May 2008)
Paper Number:
7459
Publication Date:
May 1, 2008
Subject:
Audio Archiving, Storage, Restoration, and Content Management
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.