New interactive music services have emerged, despite currently using proprietary file formats. Having a standardized file format could benefit the interoperability between these services. In this regard, the ISO/IEC Moving Picture Experts Group (MPEG) issued a new standard, the so called, MPEG-A: Interactive Music Application Format (IM AF). The purpose of this paper is to describe the design and implementation of an IM AF codec and its integration into Sonic Visualiser. In this way, the visualization of the chords or the pitch of the main melody aligned in time with the song's lyrics is achieved. Furthermore, this integration provides the semantic audio research community with a test-bed for further development and comparison of new Sonic Visualiser VAMP plug-ins, e.g., for the conversion of singing voice to text and/or automatic highlighting of lyrics for karaoke applications.
Authors:
García, Jesús; Taglialatela, Costantino; Kudumakis, Panos; Barbancho, Isabel; Tardon, Lorenzo; Sandler, Mark
Affiliations:
Queen Mary University of London, London, UK; Seconda Universita Degli Studi Di Napoli, Naples, Italy; Universidad de Málaga, Malaga, Spain(See document for exact affiliation information.)
AES Conference:
53rd International Conference: Semantic Audio (January 2014)
Paper Number:
P1-11
Publication Date:
January 27, 2014
Subject:
Audio Signal Processing and Feature Extraction
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.