In this and the companion paper Part I, the authors present the Perceptual Objective Listening Quality Assessment (POLQA), the third-generation speech quality measurement algorithm, standardized by the International Telecommunication Union in 2011 as Recommendation P.863. This paper describes the newly developed perceptual model of this standard, allowing to assess speech quality over a wide range of distortions, from “High Definition” super-wideband speech (HD Voice, audio bandwidth up to 14 kHz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 kHz), using sample rates between 48 and 8 kHz. POLQA is suited for distortions that are outside the scope of PESQ, such as linear frequency response distortions, super-wideband degradations, time stretching/compression as found in Voice-over-IP, certain types of codec distortions, reverberations, and the impact of playback volume. Part II outlines the core elements of the underlying perceptual model and presents the final results.
Beerends, John G.; Schmidmer, Christian; Berger, Jens; Obermann, Matthias; Ullmann, Raphael; Pomy, Joachim; Keyhl, Michael
Affiliations: TNO, Delft, The Netherlands; OPTICOM GmbH, Erlangen, Germany; SwissQual AG, Zuchwil, Switzerland(See document for exact affiliation information.)
JAES Volume 61 Issue 6 pp. 385-402; June 2013
Publication Date: July 8, 2013
Download Now (662 KB)
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.