Algorithmic music composition is a way of composing musical pieces with minimal to no human intervention. While recurrent neural networks are traditionally applied to many sequence-to-sequence prediction tasks, including successful implementations of music composition, their standard supervised learning approach based on input-to-output mapping leads to a lack of note variety. These models can therefore be seen as potentially unsuitable for tasks such as music generation. Generative adversarial networks learn the generative distribution of data and lead to varied samples. This work implements and compares adversarial and non-adversarial training of recurrent neural network music composers on MIDI data. The resulting music samples are evaluated by human listeners, their preferences recorded. The evaluation indicates that adversarial training produces more aesthetically pleasing music.
翻译:虽然经常神经网络传统上用于许多顺序到顺序的预测任务,包括成功实施音乐构成,但其基于投入到产出绘图的标准监督学习方法导致缺乏批注多样性,因此,这些模型可被视为可能不适合诸如音乐制作等任务; 生成对抗性网络学习数据的基因分布,并导致各种样本; 这项工作执行并比较对经常性神经网络音乐作曲家关于MIDI数据的对抗性和非对抗性培训; 由此产生的音乐样本由人类听众评价,记录他们的喜好。 评价表明,对抗性培训产生更美观的音乐。