We present a framework for audio analysis and the extraction of low-level features, mid-level structures and high-level concepts, altogether studied as a fully interwoven complex system. Composite operations are constructed via an intuitive programming language on top of Matlab. Datasets of any size can be processed thanks to implicit memory management mechanisms. The data structure enables a tight articulation between signal and symbolic layers in a unified framework. The resulting technology can be used as a pedagogical tool for the understanding of audio, speech and musical processes and concepts, and for content-based discovery of digital libraries. Other applications includes intelligent browsing and structuring of digital library, information retrieval, and the design of content-based audio interfaces.
Author:
Lartillot, Olivier
Affiliation:
University of Jyväskylä, Finland
AES Convention:
130 (May 2011)
Paper Number:
8375
Publication Date:
May 13, 2011
Subject:
Audio Content Management
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.