Parametric spatial audio rendering promises fast and perceptually convincing audio cues that remain playback-system agnostic and enable aesthetic modifications of the acoustic experience within games and virtual reality. We propose a parametric encoder for spatial room impulse responses that is tested with nine simulated rooms spanning a large range of sizes and reverberation times. A key component of the pipeline is a perceptually inspired model for determining a minimal set of salient early reflections to reduce computational complexity. The results of a listening study with 27 subjects suggest that rendering six early reflections is indiscernible from a fully-rendered reference for the tested speech content and frequency-independent room simulations based on the image source method. However, the proposed model requires further improvements with respect to detecting and selecting the most-salient early reflections.
Brinkmann, Fabian; Gamper, Hannes; Raghuvanshi, Nikunj; Tashev, Ivan
Affiliations: Audio Communication Group, TU Berlin; Microsoft Research Redmond, WA, USA; Microsoft Research Redmond, WA, USA; Microsoft Research Redmond, WA, USA(See document for exact affiliation information.)
AES Convention: 148 (May 2020) Paper Number: 10380
Publication Date: May 28, 2020
Subject: Room Acoustics
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.