The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.
Gonzalez, Raimundo; Mckenzie, Thomas; Politis, Archontis; Lokki, Tapio
Affiliations: Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Audio & Speech Processing Group, Tampere University of Technology, Tampere, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland.(See document for exact affiliation information.)
JAES Volume 70 Issue 7/8 pp. 621-633; July 2022
Publication Date: July 19, 2022
No AES members have commented on this report yet.
If you are not yet an AES member and have something important to say about this report then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.