The quality of audio recordings is often degraded by various types of disturbances, such as broadband noise, hum, clicks, and crackles. Of these, broadband noise is one of the most frequently occurring types of disturbance, especially in old recordings. Disturbances can be classified as having either a technical or acoustic origin. This research presents a novel algorithm to estimate the power spectral density (PSD) of stationary broadband noise disturbances in audio recordings. The proposed algorithm estimates the noise PSD as the mean value of an exponential distribution that corresponds to the truncated periodogram coefficients of the disturbed audio signal. A confidence value is computed to reflect the reliability of the noise PSD estimate. Noise PSD estimates with a low confidence are rejected in order to avoid degrading the desired signal when the obtained noise PSD estimate is used in a noise-reduction algorithm. Based on experiments with a large database of clean speech and music signals and different artificial and real-world broadband noise disturbances, the results show that the proposed algorithm yields reduced PSD estimation errors compared to the state-of-the-art minimum statistics algorithm for a large range of SNRs. The algorithm allows for unsupervised operation and thus constitutes an important part of a fully automatic broadband noise restoration system for audio archives.
Brandt, Matthias; Doclo, Simon; Bitzer, Joerg
Affiliations: University of Oldenburg, Dept. of Medical Physics and Acoustics and Cluster of Excellence Hearing4all, Oldenburg, Germany; Jade University of Applied Sciences, Oldenburg, Germany(See document for exact affiliation information.)
JAES Volume 67 Issue 1/2 pp. 38-53; January 2019
Publication Date: January 31, 2019
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.