Community

AES Convention Papers Forum

Creation of New Virtual Patterns for Emotion Recognition through PSOLA

Document Thumbnail

Human emotions can be recognized through speech analysis. One main problem of this discipline is the lack of databases with a sufficient number of patterns for a correct learning. This fact makes generalization in the learning process be more difficult. One possible solution is the creation of new virtual patterns, enlarging the training set. In order to carry out this enlargement, we modify the average pitch by using the technique known as Pitch Synchronous Overlap and Add combined with resampling, that allows to change the average pitch without altering neither the pitch variations nor the speech rate. Therefore, the emotion in the utterance is unaltered. Results over the original test set show that it is possible to achieve a significant reduction in the generalization effects with the proposed creation of new virtual training patterns.

Authors:
Affiliation:
AES Convention: Paper Number:
Publication Date:
Subject:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:
Username:
Password:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society