One of the biggest challenges still encounter with speech communication via a mobile phone is that it is sometimes very difficult to understand what is said when listening in a noisy place. In this paper a novel approach based on two models is introduced to increase speech intelligibility for a listener surrounded by environmental noise. One is to perceptually optimize the speech when considering simultaneous background noise, the other is to modify the speech towards a more intelligible, naturally elicited speaking style. The two models are combined to provide more understandable speech even in a loud noisy environment environment, even in the case where we are unable to increase the speech volume. The improvements in perceptual quality and intelligibility are shown by Perceptual Objective Listening Quality Assessment and Listening Effort Mean Opinion Score evaluation.
Authors:
Choo, Kihyun; Porov, Anton; Koutsogiannaki, Maria; Francois, Holly; Jeong, Jonghoon; Sung, Hosang; Oh, Eunmi
Affiliations:
Samsung Electronics Co., Ltd., Suwon, Korea; Samsung R&D Institute Russia, Moscow, Russia; Samsung R&D Institute UK; Samsung Electronics R&D Institute UK, Staines-Upon Thames, Surrey, UK; Samsung Electronics Co. Ltd., Seoul, Korea; Samsung Electronics, Korea; Samsung Electronics Co., Ltd., Seoul, Korea(See document for exact affiliation information.)
AES Convention:
143 (October 2017)
Paper Number:
9810
Publication Date:
October 8, 2017
Subject:
Signal Processing
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.