Community

AES Convention Papers Forum

Synthesising Prosody with Variable Resolution

Document Thumbnail

This paper highlights some of the challenges involved in predicting the spatial reproduction performance of surround sound systems serving large and acoustically live listening areas and highlights the shortcomings of current objective assessment methods. This paper presents a technique for synthesising prosody based upon information extracted from spoken utterances. We are interested in designing systems that learn how to speak autonomously, by interacting with humans. Our motivation for an in-depth investigation on prosody is prompted by the fact that infants seem to have acute prosodic listening during the first months of life. We presume that any system aimed at learning some form of speaking skills should display this fundamental capacity. This paper addresses two fundamental components for the development of such systems: prosody listening and prosody production. It begins with a brief introduction to the problem within the context of our research objectives. Then it introduces the system and presents some commented examples. The paper concludes with final remarks and a brief discussion on future developments.

Author:
Affiliation:
AES Convention: Paper Number:
Publication Date:
Subject:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:
Username:
Password:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society