For object-based audio an appropriate definition of metadata is needed to ensure flexible playback in any reproduction scenario and to allow for interactivity. Important use-cases for object-based audio and audio interactivity are described and metadata requirements are derived. A metadata scheme is defined that allows for enhanced audio rendering techniques such as content-dependent processing, automatic scene scaling and enhanced level control. Also, a metadata preprocessing logic is proposed that prepares rendering and playout and allows for user interaction with the audio content of an object-based scene. In addition, the paper points out how the metadata can be transported efficiently in a bitstream. The proposed metadata scheme has been adopted and integrated into the currently finalized MPEG-H 3D Audio standard.
Authors:
Füg, Simone; Hölzer, Andreas; Borß, Christian; Ertel, Christian; Kratschmer, Michael; Plogsties, Jan
Affiliation:
Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany
AES Convention:
137 (October 2014)
Paper Number:
9097
Publication Date:
October 8, 2014
Subject:
Spatial Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.