Hearing impaired (HI) listeners often struggle to follow conversations when exposed in a complex acoustic environment. This is partly due to the reduced ability in recovering the target speech Temporal Envelope (ENV) cues from Temporal Fine Structure (TFS). This study investigates the enhancement of speech intelligibility in HI listeners by processing the ENV of speech signals corrupted by real-world environmental noise. An Encoder-Decoder Long Short Term Memory (LSTM) model is exploited after perceptually motivated processing stages to compensate for the important ENV characteristics of comprehensible speech for hearing impairment. The computational model is evaluated using the Short-Time Objective Intelligibility (STOI) measure for speech intelligibility. Finally, results indicate a 6% improvement in the mean STOI measure across different SNR values.
Authors:
Thoidis, Iordanis; Vrysis, Lazaros; Pastiadis, Konstantinos; Markou, Konstantinos; Papanikolaou, George
Affiliation:
Aristotle University of Thessaloniki, Thessaloniki, Greece
AES Convention:
146 (March 2019)
Paper Number:
10206
Publication Date:
March 10, 2019
Subject:
Poster Session 4
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.