The objective of this paper is to determine which of the MPEG-7 standard low-level sound descriptors are the most significant in the process of automatic classification of musical instrument sounds. First, pitch detection is performed. Then, the parametrization stage of musical sounds based on descriptors contained in the MPEG-7 standard is carried out. Next, a thorough statistical analysis of the feature vectors obtained is performed. For the purpose of automatic classification, two decision systems based on artificial neural networks (ANNs) and rough sets, are used. Both decision systems are trained with feature vectors consisted mostly of parameters contained in the MPEG-7 standard, however their content being reduced after statistical analyses. In addition, a comparison of results obtained by these decision systems with the results got from the nearest neighbor algorithm is made.
Authors:
Szczuko, Piotr; Dalka, Piotr; Dabrowski, Marcin; Kostek, Bozena
Affiliation:
Gdansk University of Technology, Faculty of Electronics, Telecommunications and Informatics, Multimedia Systems Department, Gdansk, Poland
AES Convention:
116 (May 2004)
Paper Number:
6105
Publication Date:
May 1, 2004
Session Subject:
Room and Architectural Acoustics; Musical Acoustics
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.