An audio processor that integrates intelligent classification and preprocessing algorithms is presented. Audio features in the time and frequency domains are extracted and processed prior to classification. Classification algorithms, including the nearest neighbor rule (NNR), artificial neural networks (ANN), fuzzy neural networks (FNN), and hidden Markov models (HMM), are used to classify and identify singers and musical instruments. A training phase is required to establish a feature space template, followed by a test phase in which the audio features of the test data are calculated and matched to the feature space template. In addition to audio classification, the proposed system provides several independent component analysis (ICA)-based preprocessing functions for blind source separation, voice removal, and noise reduction. The proposed techniques were applied to process various kinds of audio program materials. The test results reveal that the performance of the methods is satisfactory, but varies slightly with the algorithm and program materials used in the tests.
Authors:
Bai, Mingsian R.; Chen, Meng-chun
Affiliation:
Department of Mechani Engineering, National Chiao-Tung University, Hsin-Chu, Taiwan
JAES Volume 55 Issue 5 pp. 372-384; May 2007
Publication Date:
May 15, 2007
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.