Knowledge of room geometry is a fundamental component for modeling acoustic environments. Since most common methods for room geometry estimation are based on prior knowledge, the generalization to unknown environments is somewhat limited. Deep learning based approaches have delivered promising results for the blind estimation of acoustic parameters considering mainly monaural signals. The purpose of this contribution is to investigate the effect of multichannel higher-order Ambisonics (HOA) signals on the performance of a convolutional recurrent neural network for blind room geometry estimation. Therefore a HOA-dataset of noisy speech signals in simulated rooms with realistic frequency-dependent reflection coefficients is introduced. Results show that for each additional Ambisonics order the estimation performance increases with the fourth-order model achieving a mean absolute error of 1.24 m averaged over all three room dimensions.
Poschadel, Nils; Hupke, Robert; Preihs, Stephan; Peissig, Jürgen
Affiliation: Leibniz University Hannover, Institute of Communications Technology, Hannover, Germany
AES Convention: 150 (May 2021) Paper Number: 10482
Publication Date: May 24, 2021
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.