Community

AES Engineering Briefs Forum

Considerations for the Next Generation of Singing Tutor Systems

Recently software systems have been proposed to accelerate the progress of singing beginners. The basics of these systems are: the pitch of the sung notes is detected and algorithmic errors removed. Then, an alignment is made with a melodic ground truth, often as a midi representation, using techniques including Dynamic Time Warping and Hidden Markov Models. Although results have been reasonable, significant drawbacks to these alignment schemes include how a “musically acceptable” alignment can be identified, dynamic singer behavior, multiple repeated notes, and dealing with omitted or extra notes. To this end an improved singing analysis system structure is proposed that includes psychoacoustic models and intelligent decision making. Justification is given along with a description of a structured evaluation procedure.

Authors: Faghih, Behnam; Timoney, Joseph
Affiliation: Maynooth University, Maynooth, Kildare, Ireland
AES Convention: 146 (March 2019) eBrief:506
Publication Date: March 10, 2019
Subject: Microphones and Circuits

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this paper yet.

Subscribe to this discussion

To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

Navigation

AES Engineering Briefs Forum

Considerations for the Next Generation of Singing Tutor Systems

Subscribe to this discussion

Start a discussion!

ABOUT AES

Contact Us