This paper describes the adaptation of Efros & Leung's pixel-based Image Texture Synthesis (ITS) to 1-D for Sound Texture Synthesis (STS). The goal is the creation of a long, dynamic, sound "texture" from a much shorter audio training example. The Dual-Tree Complex Wavelet Transform (DT-CWT) is used for optimization, to good effect. We define the concept of High Resolution Sound Texture Synthesis (HR-STS) as the texturing of high resolution, multi-channel sound recordings with retention of stereophonic effects. HR-STS is useful for installations, computer games, audio repair and low-bandwidth media devices. We test a variety of real-world training examples including ambient sounds, speech snippets and music. The resulting sound textures are plausible and varied without sounding "tiled"' from the training examples.
Authors:
Kokaram, Anil; O'Regan, Deirdre
Affiliation:
Trinity College Dublin
AES Conference:
31st International Conference: New Directions in High Resolution Audio (June 2007)
Paper Number:
17
Publication Date:
June 1, 2007
Subject:
High Resolution Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.