
AES Convention Papers Forum

Generating melodic dictations using Markov Chains and LSTM neural networks

Document Thumbnail

Melodic dictations are aural training exercises that require students to transcribe the melody they hear into musical notation. In this paper, we propose three algorithms that generate single-voice melodies that could be serve as melodic dictations. The first algorithm utilizes a higher-order Markov Chain model to generate melodic patterns based on a given data set of training set dictations. The second algorithm employs a neural network with Long Short-Term Memory (LSTM) layers and the Bahdanau attention mechanism. The third algorithm generates melodies by choosing each note randomly. We analyzed the generated dictations using the dissimilarity index based on the cross-correlation, to demonstrate that the algorithms generate novel and diverse melodic dictations. To evaluate the musical quality of the melodies, we conducted a survey in which professional music theory teachers graded the dictations from the training set and those generated by the algorithms. The results indicate that some of the generated dictations are comparable in quality to those in the training set and could find potential applications in musical education.

AES Convention: Paper Number:
Publication Date:

Click to purchase paper as a non-member or you can login as an AES member to see more options.

No AES members have commented on this Music AI yet.

Subscribe to this discussion

RSS Feed To be notified of new comments on this Music AI you can subscribe to this RSS feed. Forum users should login to see additional options.

Start a discussion!

If you would like to start a discussion about this Music AI and are an AES member then you can login here:

If you are not yet an AES member and have something important to say about this Music AI then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.

AES - Audio Engineering Society