MPEG-H 3D Audio is the current standard for the compression of higher-order ambisonics data. It uses singular value decomposition (SVD) to spatially decorrelate higher-order ambisonics data, followed by the modified discrete cosine transform to exploit temporal decorrelation. Prominent and ambient sound components are then separately encoded (e.g., using the standard core audio codec) and sent to the decoder. Significant improvements in bitrate and audio quality have been gained in earlier work over MPEG-H by applying the SVD operation in the frequency domain rather than the ambisonics domain. In this work, we provide additional compression gains by adaptively calculating and extending the set of SVD basis vectors, at negligible increase in side information cost, using information attained from the previously reconstructed frame. Objective and subjective results provide evidence for higher compression gains when compared to existing methods.
Authors:
Namazi, Mahmoud; Elshafiy, Ahmed; Rose, Kenneth
Affiliation:
University of California, Santa Barbara, CA, USA
AES Conference:
2022 AES International Conference on Audio for Virtual and Augmented Reality (August 2022)
Paper Number:
28
Publication Date:
August 15, 2022
Subject:
Paper
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.