AES Convention Papers Forum

Room Geometry Estimation from Higher-Order Ambisonics Signals using Convolutional Recurrent Neural Networks

Document Thumbnail

Knowledge of room geometry is a fundamental component for modeling acoustic environments. Since most common methods for room geometry estimation are based on prior knowledge, the generalization to unknown environments is somewhat limited. Deep learning based approaches have delivered promising results for the blind estimation of acoustic parameters considering mainly monaural signals. The purpose of this contribution is to investigate the effect of multichannel higher-order Ambisonics (HOA) signals on the performance of a convolutional recurrent neural network for blind room geometry estimation. Therefore a HOA-dataset of noisy speech signals in simulated rooms with realistic frequency-dependent reflection coefficients is introduced. Results show that for each additional Ambisonics order the estimation performance increases with the fourth-order model achieving a mean absolute error of 1.24 m averaged over all three room dimensions.

AES Convention: Paper Number:
Publication Date:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society