The research interest in technologies for supporting people in their own homes is constantly increasing. In this context this paper proposes a speech-interfaced system for recognizing home automation commands and distress calls. The robustness of the system is increased by employing Power Normalized Cepstral Coefficients as features and by using an adaptive algorithm to reduce known sources of interference. In addition, the mismatch introduced by vocal effort variability is reduced employing a vocal effort classifier and multiple acoustic models. The performance has been evaluated on ITAAL, a recently proposed corpus of home automation commands and distress calls in Italian. The results confirm that the adopted solutions are effective to be employed in a distorted acoustic scenario.
Authors:
Principi, Emanuele; Bonfigli, Roberto; Squartini, Stefano; Piazza, Francesco
Affiliation:
Università Politecnica delle Marche, Ancona, Italy
AES Convention:
136 (April 2014)
Paper Number:
9089
Publication Date:
April 25, 2014
Subject:
Applications in Audio/Education/Forensics
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.