Finding structure and repetitions in a musical signal is crucial to enable interactive browsing into large databases of music files. Notably, it is useful to produce short summaries of musical pieces, or "audio thumbnails". In this paper, we propose an algorithm to find repeating patterns in an acoustic musical signal. We first segment the signal into a meaningful succession of timbres. This gives a reduced string representation of the music, the texture score, which doesn't encode any pitch information. We then look for patterns in this representation, using two techniques from image processing: Kernel Convolution and Hough Transform. The resulting patterns are relevant to musical structure, which shows that pitch is not the only useful representation for the structural analysis of polyphonic music.
Authors:
Aucouturier, Jean-Julien; Sandler, Mark
Affiliations:
Sony Computer Science Laboratory, Paris, France ; Dep. of Electronic Engineering, Queen Mary University of London, Mile End Road, London, UK(See document for exact affiliation information.)
AES Conference:
22nd International Conference: Virtual, Synthetic, and Entertainment Audio (June 2002)
Paper Number:
000204
Publication Date:
June 1, 2002
Subject:
Virtual, Synthetic and Entertainment Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.