Since early perceptual audio coders such as mp3, the underlying psychoacoustic model that controls the encoding process has not undergone many dramatic changes. Meanwhile, modern audio coders have been equipped with semi-parametric or parametric coding tools such as audio bandwidth extension. Thereby, the initial psychoacoustic model used in a perceptual coder, just considering added quantization noise, became partly unsuitable. We propose the use of an improved psychoacoustic excitation model based on an existing model proposed by Dau et al. in 1997. This modulation-based model is essentially independent from the input waveform by calculating an internal auditory representation. Using the example of MPEG-H 3D Audio and its semi-parametric Intelligent Gap Filling (IGF) tool, we demonstrate that we can successfully control the IGF parameter selection process to achieve overall improved perceptual quality.
Authors:
Disch, Sascha; van de Par, Steven; Niedermeier, Andreas; Burdiel Pérez, Elena; Berasategui Ceberio, Ane; Edler, Bernd
Affiliations:
University of Oldenburg, Oldenburg, Germany; Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; Friedrich Alexander University, International Audio Laboratories Erlangen, Erlangen, Germany(See document for exact affiliation information.)
AES Convention:
145 (October 2018)
Paper Number:
10029
Publication Date:
October 7, 2018
Subject:
Perception – Part 1
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.