Aknowledgments

We would like the thank Erica Cooper, among the authors from “Text-to-speech Synthesis techniques for MIDI-to-Audio Synthesis” (in Proc. of the 11th ISCA Speech Synthesis Workshop, 2021) for kindly sharing their samples.

This work was supported by European Union’s Horizon 2020 research and innovation programme under grant number 951911 - AI4Media.