Community

AES Conference Papers Forum

Spatial Audio Compression with Adaptive Singular Value Decomposition Using Reconstructed Frames

Document Thumbnail

MPEG-H 3D Audio is the current standard for the compression of higher-order ambisonics data. It uses singular value decomposition (SVD) to spatially decorrelate higher-order ambisonics data, followed by the modified discrete cosine transform to exploit temporal decorrelation. Prominent and ambient sound components are then separately encoded (e.g., using the standard core audio codec) and sent to the decoder. Significant improvements in bitrate and audio quality have been gained in earlier work over MPEG-H by applying the SVD operation in the frequency domain rather than the ambisonics domain. In this work, we provide additional compression gains by adaptively calculating and extending the set of SVD basis vectors, at negligible increase in side information cost, using information attained from the previously reconstructed frame. Objective and subjective results provide evidence for higher compression gains when compared to existing methods.

Authors:
Affiliation:
AES Conference:
Paper Number:
Publication Date:
Subject:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this paper and are an AES member then you can login here:
Username:
Password:

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society