This paper analyzes and compares different methods for audio chroma feature extraction. The chroma feature is a descriptor, which represents the tonal content of a musical audio signal in a condensed form. Therefore chroma features can be considered as important prerequisite for high-level semantic analysis, like chord recognition or harmonic similarity estimation. A better quality of the extracted chroma feature enables much better results in these high-level tasks. In order to discover the quality of chroma features, seven different state-of-the-art chroma feature extraction methods have been implemented. Based on an audio database, containing 55 variations of triads, the output of these algorithms is critically evaluated. The best results were obtained with the Enhanced Pitch Class Profile.
Authors:
Stein, Michael; Schubert, Benjamin M.; Gruhne, Matthias; Gatzsche, Gabriel; Mehnert, Markus
Affiliations:
Ilmenau University of Technology, Ilmenau, Germany; Fraunhofer Institute for Digital Media Technology, Ilmenau, Germany(See document for exact affiliation information.)
AES Convention:
126 (May 2009)
Paper Number:
7814
Publication Date:
May 1, 2009
Subject:
Signal Analysis, Measurements, Restoration
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.