The ability to perceptually modify drum recording parameters in a post-recording process would be of great benefit to engineers limited by time or equipment. In this work, a data-driven approach to post-recording modification of the dampening and microphone positioning parameters commonly associated with snare drum capture is proposed. The system consists of a deep encoder that analyzes audio input and predicts optimal parameters of one or more third-party audio effects, which are then used to process the audio and produce the desired transformed output audio. Furthermore, two novel audio effects are specifically developed to take advantage of the multiple parameter learning abilities of the system. Perceptual quality of transformations is assessed through a subjective listening test, and an objective evaluation is used to measure system performance. Results demonstrate a capacity to emulate snare dampening; however, attempts were not successful in emulating microphone position changes.
Authors:
Cheshire, Matthew; Drysdale, Jake; Enderby, Sean; Tomczak, Maciej; Hockman, Jason
Affiliation:
Sound and Music Analysis Group (SoMA), Digital Media Technology Lab (DMT Lab), School of Computing and Digital Technology, Birmingham City University, Birmingham, United Kingdom
JAES Volume 70 Issue 9 pp. 742-752; September 2022
Publication Date:
September 12, 2022
Download Now (676 KB)
This paper is Open Access which means you can download it for free.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.