In this paper, we propose a time-frequency representation where the frequency bins are distributed uniformly in log-frequency and their Q-factors obey a linear function of the bin center frequencies. The latter allows for time-frequency representations where the bandwidths can be e.g. constant on the log-frequency scale (constant Q) or constant on the auditory critical-band scale (smoothly varying Q). The proposed techniques are published as a Matlab toolbox that extends [3]. Besides the features that stem from [3] - perfect reconstruction and computational efficiency - we propose here a technique for computing coefficient phases in a way that makes their interpretation more natural. Other extensions include flexible control of the Q values and more regular sampling of the time-frequency plane in order to simplify signal processing in the transform domain.
Authors:
Schörkhuber, Christian; Klapuri, Anssi; Holighaus, Nicki; Dörfler, Monika
Affiliations:
Austrian Academy of Sciences, Vienna, Austria; Tampere University of Technology, Tampere, Finland; University of Music and Performing Arts Graz, Graz, Austria; University of Vienna, Vienna, Austria(See document for exact affiliation information.)
AES Conference:
53rd International Conference: Semantic Audio (January 2014)
Paper Number:
P2-5
Publication Date:
January 27, 2014
Subject:
Audio Signal Processing and Feature Extraction
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.