A musical data set for note-level segmentation of monophonic music is presented. It contains 36 excerpts from commercial recordings of monophonic classical western music and features the instrument groups strings, woodwind and brass. The excerpts are self-contained phrases with a mean length of 17.97 seconds and an average of 20 notes. All phrases are played in moderate tempo, mostly with significant amounts of expressive articulation. A manually annotated ground truth splits each item into a sequence of the three states note, transition and rest. The set is designed as an open source project, aiming at the development and evaluation of algorithms for segmentation, music performance analysis and feature selection. This paper presents the process of ground truth labeling and a detailed description of the data set and its properties.
Authors:
von Coler, Henrik; Lerch, Alexander
Affiliations:
Georgia Tech Center for Music Technology, Atlanta, GA, USA; SIM - Staatliches Institut für Musikforschung Preußischer Kulturbesitz, Berlin, Germany(See document for exact affiliation information.)
AES Conference:
53rd International Conference: Semantic Audio (January 2014)
Paper Number:
P2-3
Publication Date:
January 27, 2014
Subject:
Automatic Music Transcription
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.