Attempts to use generative models for music generation have been common in recent years, and some of them have achieved good results. Pieces generated by some of these models are almost indistinguishable from those being composed by human composers. However, the research on the evaluation system for machine-generated music is still at a relatively early stage, and there is no uniform standard for such tasks. This paper proposes a stacked-LSTM binary classifier based on a language model, which can be used to distinguish the human composer's work from the machine-generated melody by learning the MIDI file's pitch, position, and duration.
翻译:近年来,人们经常尝试为音乐创作使用基因化模型,其中一些已经取得了良好的成果。其中一些模型产生的碎片几乎与由人类作曲家组成的模型几乎无法区分。然而,关于机器制作音乐的评价系统的研究仍处于相对早期阶段,对此类任务没有统一的标准。本文提议以语言模型为基础建立一个堆叠式的LSTM二进制分类器,通过学习 MIDI 文件的阵列、位置和持续时间,可以用来区分人类作曲家的工作和机器制作的旋律。