In This Section
Perceptual Effects of Dynamic Range Compression in Popular Music Recordings - January 2014
Accurate Calculation of Radiation and Diffraction from Loudspeaker Enclosures at Low Frequency - June 2013
New Measurement Techniques for Portable Listening Devices: Technical Report - October 2013
AES Convention Papers Forum
A Voice Classification System for Younger Children with Applications to Content Navigation
A speech classification system is proposed which has applications for accessibility of content for younger children. To allow a young child to access online content (where typical interfaces such as search engines or hierarchical navigation would be inappropriate) we propose a voice classification system trained to recognise a range of sounds and vocabulary typical of younger children. As an example we design a system for classifying animal noises. Acoustic features are extracted from a corpus of animal noises made by a class of young children. A Support Vector Machine is trained to classify the sounds into one of 12 corresponding animals. We investigate the precision and recall of the classifier for various classification parameters. We investigate an appropriate choice of features to extract from the audio and compare the performance when using mean Mel-frequency Cepstral Coefficients (MFCC), a single-Gaussian model fitted to the MFCCs as well as a range of temporal features. To investigate the real-world applicability of the system we pay particular attention to the difference between training a generic classifier from a collected corpus of examples and one trained to a particular voice.
No AES members have commented on this paper yet.
Subscribe to this discussion
Start a discussion!
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.