The ability to generate appropriate auditory localization cues is an important requisite of spatial audio rendering technology that contributes to the plausibility of virtual sounds presented to a user, especially in XR applications (VR/AR/MR). Algorithmic approaches have been proposed to quantify such technologies’ ability to reproduce interaural level difference (ILD) cues through regression and statistical methods, providing a useful standardization and automation method to estimate the localization accuracy potential of a given spatial audio rendering engine. Previous approaches are extended to include interaural time difference (ITD) cues as part the perceptual transform through the use of the interaural transfer function (ITF). The extended algorithmic approach of quantifying localization accuracy may provide an adequate substitute for critical listening studies as an evaluation method. However, this approach has not yet been validated through comparison with localization listening studies. A review of listening tests are reviewed in conclusion to increase confidence in presented methods of algorithmically quantifying localization accuracy potential of a spatial audio rendering engine.
Mathew, Justin D; Januszkiewicz, Lukasz; Pensko, Maria; Hertensteiner, Mark; Audfray, Rémi
Affiliations: Magic Leap; SoftServe Inc.; Facebook Reality Labs(See document for exact affiliation information.)
AES Convention: 151 (October 2021) eBrief:648
Publication Date: October 13, 2021
Subject: Multichannel and spatial audio processing and applications
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.