This paper proposes a Gaussian mixture model (GMM)-based music discrimination system for mobile broadcasting receivers. The objective of the system is automatically archiving music signals from audio broadcasting programs that are normally mixed with human voices, acoustic noises, commercial advertisements, and so on. To enhance the robustness of the system performance and to sharply cut the starting/ending-point of the recording, we also introduce a post-processing module whose features consist of signal duration, energy dynamics, and local variation of feature statistics. Experimental results to various input signals verify the superiority of the proposed system.
Authors:
Kang, Hong; Kang, Hyun; Song, Myung
Affiliations:
Yonsei University; Kangnam University(See document for exact affiliation information.)
AES Conference:
34th International Conference: New Trends in Audio for Mobile and Handheld Devices (August 2008)
Paper Number:
17
Publication Date:
August 1, 2008
Subject:
Audio for Mobile & Handheld Devices: Signal Processing
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.