The diffusion of large music collections has determined the need for algorithms enabling fast song retrieval from query audio excerpts. This is the case of online media sharing platforms that may want to detect copyrighted material. In this paper we start from a proposed state-of-the-art algorithm for robust music matching based on spectrogram comparison leveraging computer vision concepts. We show that it is possible to further optimize this algorithm exploiting more recent image processing techniques and carrying out the analysis on limited temporal windows, still achieving accurate matching performance. The proposed solution is validated on a dataset of 800 songs, reporting an 80% decrease in computational complexity for an accuracy loss of about only 1%.
Zanoni, Massimiliano; Lusardi, Stefano; Bestagini, Paolo; Canclini, Antonio; Sarti, Augusto; Tubaro, Stefano
Affiliation: Politecnico di Milano, Milan, Italy
AES Convention: 142 (May 2017) Paper Number: 9763
Publication Date: May 11, 2017
Subject: Audio Analysis and Synthesis
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.