Evaluating the quality of voice recordings invokes two different aspects of preference: the appeal of the speaker’s voice characteristics and the perceived degradation of the actual recording. Subjective experiments where no reference ideal is provided, called absolute category ratings, fuse together both components. This research investigates if voice postprocessing (timbre optimization, loudness compensations, de-essing, room reverberation, and noise suppression) can improve perceived quality. The results show that none of these processing methods significantly improve the perceived quality and mostly give rise to a quality degradation. The voice preference is a very significant factor in the perceived quality. The subjective results are used to validate ITU-T Recommendation P.863 (POLQA) in the high quality region.
Authors:
Beerends, John; Beerends, Imre
Affiliations:
TNO, The Hague, The Netherlands; Mantis Audio, Wateringen, The Netherlands(See document for exact affiliation information.)
JAES Volume 63 Issue 3 pp. 174-183; March 2015
Publication Date:
March 16, 2015
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this report yet.
To be notified of new comments on this report you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this report then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.