Content-based Music Information Retrieval includes tasks that involve estimating the pitched content of music, such as the main melody or the bass line. To date, the field lacks a good machine representation that models the human perception of pitch, with each task using specific, tailored representations. This paper proposes factoring pitch estimation problems into two stages, where the output of the first stage for all tasks is a multipitch contour representation. Further, we propose the adoption of \emph{pitch contours} as a unit of pitch organization. We give a review of the existing work on contour extraction and characterization, and present experiments that demonstrate the discriminability of pitch contours. We conclude with directions for future research.
Authors:
Bittner, Rachel M.; Salamon, Justin; Bosch, Juan J.; Bello, Juan P.
Affiliations:
New York University, New York, NY, USA; Universitat Pompeu Fabra, Barcelona, Spain(See document for exact affiliation information.)
AES Conference:
2017 AES International Conference on Semantic Audio (June 2017)
Paper Number:
3-1
Publication Date:
June 13, 2017
Subject:
Pitch Tracking
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.