We present a novel music generation framework for music infilling, with a user friendly interface. Infilling refers to the task of generating musical sections given the surrounding multi-track music. The proposed transformer-based framework is extensible for new control tokens as the added music control tokens such as tonal tension per bar and track polyphony level in this work. We explore the effects of including several musically meaningful control tokens, and evaluate the results using objective metrics related to pitch and rhythm. Our results demonstrate that adding additional control tokens helps to generate music with stronger stylistic similarities to the original music. It also provides the user with more control to change properties like the music texture and tonal tension in each bar compared to previous research which only provided control for track density. We present the model in a Google Colab notebook to enable interactive generation.
翻译:我们为音乐填充提供了一个具有用户友好界面的新音乐生成框架。 填充指的是根据周围的多轨音乐生成音乐部分的任务。 提议的变压器框架可以作为新的控制符号, 作为新增的音乐控制符号, 例如每条酒吧的调色调和在这项工作中跟踪多调级别。 我们探索包含若干具有音乐意义的控制符号的效果, 并使用与音道和节奏有关的客观指标来评估结果。 我们的结果显示, 添加额外的控制符号有助于生成音乐, 与原始音乐具有更强的时态相似性。 它也为用户提供了更大的控制, 以改变每个酒吧的音乐纹理和调色调等属性, 与以往只提供音轨密度控制的研究相比。 我们用Google Colab的笔记本展示了模型, 以便能够互动生成 。