Our work presents a speech/music discrimination approach based on fuzzy rules for selecting the suitable coder required in an intelligent audio coding system. When the same coder is used for both speech and music, is difficult to achieve good audio quality and low bit rates for both types of signals. We propose using a simple feature, called Warped LPC-based Spectral Centroid (WLPC-SC) for speech/music discrimination. In order to select the suitable audio coder for each audio frame, an expert system is proposed. The main advantage of the proposed approach is the low computational cost in both the speech/music discrimination and coder selection stages. It allows its use in real time applications as internet audio streaming.
Authors:
Garcia Gálan, Sebastian; Muñoz-Exposito, Jose Enrique; Rivas Peña, Fernando; Ruiz-Reyes, Nicolas; Vera-Candeas, Pedro
Affiliation:
Universidad de Jaen
AES Convention:
120 (May 2006)
Paper Number:
6676
Publication Date:
May 1, 2006
Session Subject:
Analysis and Synthesis of Sound; Mobile Phone Audio; Automobile Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.