The spectral distribution of music audio has an important influence on listener perception, but large-scale characterizations are lacking. Therefore, the long-term average spectrum (LTAS) was analyzed for a large dataset of popular music. The mean LTAS was computed, visualized, and then approximated with two quadratic fittings. The fittings were subsequently used to derive the spectrum slope. By applying harmonic/percussive source separation, the relationship between LTAS and percussive prominence was investigated. A clear relationship was found; tracks with more percussion have a relatively higher LTAS in the bass and high frequencies. We show how this relationship can be used to improve targets in automatic equalization. Furthermore, we assert that variations in LTAS between genres is mainly a side-effect of percussive prominence.
Elowsson, Anders; Friberg, Anders
Affiliation: KTH Royal Institute of Technology, Stockholm, Sweden
AES Convention: 142 (May 2017) Paper Number: 9762
Publication Date: May 11, 2017
Subject: Audio Analysis and Synthesis
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.