The purpose of this study was to apply Self-Organizing Maps to differentiate between the correct and the incorrect allophone pronunciations and to compare the results with subjective evaluation. Recordings of a list of target words, containing selected allophones of English plosive consonants, the velar nasal and the lateral consonant, were made twice. First, the target words were read from the list by nine non-native speakers and then repeated after a phonology expert’s recorded sample. Afterwards, two recorded signal sets were segmented into allophones and parameterized. For that purpose, a set of descriptors, commonly employed in music information retrieval, was utilized to determine whether they are effective in allophone analysis. The phonology expert’s task was to evaluate the pronunciation accuracy of each uttered allophone. Extracted feature vectors along with the assigned ratings were applied to SOMs.
Kostek, Bozena; Piotrowska, Magdalena; Ciszewski, Tomasz; Czyzewski, Andrzej
Affiliations: Gdansk University of Technology, Gdansk, Poland; Audio Acoustics Lab.; University of Gdansk, Gdansk, Poland(See document for exact affiliation information.)
AES Convention: 143 (October 2017) Paper Number: 9847
Publication Date: October 8, 2017
Subject: Signal Processing
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.