Decomposing an arbitrary audio signal into direct and diffuse components is useful for applications such as spatial audio coding, spatial format conversion, binaural rendering, and spatial audio enhancement. This paper describes direct-diffuse decomposition methods for multichannel signals using a linear system of pairwise correlation estimates. The expected value of a correlation coefficient is analytically derived from a signal model with known direct and diffuse energy levels. It is shown that a linear system can be constructed from pairwise correlation coefficients to derive estimates of the Direct Energy Fraction (DEF) for each channel of a multichannel signal. Two direct-diffuse decomposition methods are described that utilize the DEF estimates within a time-frequency analysis-synthesis framework.
Authors:
Thompson, Jeffrey; Smith, Brandon; Warner, Aaron; Jot, Jean-Marc
Affiliation:
DTS, Inc., Calabasas, CA, USA
AES Convention:
133 (October 2012)
Paper Number:
8807
Publication Date:
October 25, 2012
Subject:
Spatial Audio Processing
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.