Audio classification methods work well when fine-tuned to reduced domains, such as musical instrument classification or simplified sound effects taxonomies. Classification methods cannot currently offer the detail needed in general sound recognition. A real-world-sound recognition tool would require thousands of classifiers, each specialized in distinguishing little details and a taxonomy that represents the real world. We describe the use of WordNet, a semantic network that organizes real world knowledge as the taxonomy backbone. In order to overcome the huge number of classifiers to distinguish an ever growing number of sounds, the recognition engine uses nearest-neighbor classifier with a database of isolated sounds unambiguously linked to WordNet concepts.
Authors:
Cano, Pedro; Koppenberger, Markus; Herrera, Perfecto; Le Groux, Sylvain; Ricard, Julien; Wack, Nicolas
Affiliation:
Institut de l'Audiovisual, Universitat Pompu Fabra, Barcelona, Spain
AES Convention:
116 (May 2004)
Paper Number:
6139
Publication Date:
May 1, 2004
Session Subject:
Audio Recording and Reproduction; Archiving and Content Management
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.