This paper is related to three-dimensional (3D) audio signal processing for virtual spatial audio conference applications. The core idea is to provide users of a virtual spatial audio conference with recommendations for the optimal spatial layout (spatial arrangement) of participants where optimal implies maximal speech intelligibility. That is, the listener’s ability to understand speech from an individual speaker in a multi-speaker scenario is enhanced. The idea combines information of the individual’s voice together with directional audio information to estimate the speech intelligibility of candidate spatial layouts (spatial arrangement). The candidate layout that provides the best speech intelligibility estimate is then selected.
Authors:
Pang, Liyun; Hoffmann, Pablo
Affiliation:
Huawei Technologies Duesseldorf GmbH, Düsseldorf, Germany
AES Convention:
140 (May 2016)
Paper Number:
9573
Publication Date:
May 26, 2016
Subject:
Rendering Systems
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.