We present an investigation into signal processing models appropriate for audio, and especially high quality musical signals, by means of Bayesian atomic decompositions. At present, many models rely on short-term stationarity of the audio, or highly limiting forms of non-stationarity. Moreover, they are well-suited only to low-level inference tasks. We seek to formulate a new generation of audio models that will address the main limitations of the existing ones and permit high-level inference. As we show, such models result from the marriage of an overcomplete dictionary of time-frequency atoms with structured hierarchical prior probability distributions developed specifically for audio signals, in order to model coefficient correlation in time and frequency.
Authors:
Wolfe, Patrick J.; Godsill, Simon J.; Ng, Wee Jing; Doerfler, Monika
Affiliations:
Department of Engineering, University of Cambridge,Cambridge, UK ; Institute for Mathematics, University of Vienna, Strudlhofgasse, Vienna, Austria(See document for exact affiliation information.)
AES Convention:
112 (April 2002)
Paper Number:
5624
Publication Date:
April 1, 2002
Subject:
Recording, Restoration, and Processing
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.