Algorithms in speech and audio applications are often evaluated under adverse conditions to evaluate their robustness against additive noise. This research describes a method to generate artificial but perceptually plausible acoustic disturbances, thereby providing a controlled and repeatable context for evaluating algorithms. This allows for control of such noise parameters as coloration, modulation, and amplitude distribution independently of each other, while also providing the means to define the amount of coherence among all the signal channels. Results of a listening test in a monaural setup show no significant difference in naturalness between synthesized and original signal. It is not always obvious how to create natural noise. For example, it was observed that white Gaussian noise is often an inappropriate noise. Frequency-dependent modulations on a short time scale appear to contribute to naturalness. Synthesizing vinyl/shellac, which has a particular type of impulse character, requires a unique approach to synthesis. Rain and applause synthesis proved to be challenging.
Authors:
Adrian, Jens-Alrik; Gerkmann, Timo; van de Par, Steven; Bitzer, Joerg
Affiliations:
Institute for Hearing Technology and Audiology, Jade University of Applied Sciences, Oldenburg, Germany; Signal Processing, Department of Informatics, Universität Hamburg, Germany; Acoustics Group, Cluster of Excellence(See document for exact affiliation information.)
JAES Volume 65 Issue 11 pp. 914-928; November 2017
Publication Date:
November 28, 2017
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.