Originating in the Renaissance and burgeoning in the digital era, tablatures are a commonly used music notation system which provides explicit representations of instrument fingerings rather than pitches. GuitarPro has established itself as a widely used tablature format and software enabling musicians to edit and share songs for musical practice, learning, and composition. In this work, we present DadaGP, a new symbolic music dataset comprising 26,181 song scores in the GuitarPro format covering 739 musical genres, along with an accompanying tokenized format well-suited for generative sequence models such as the Transformer. The tokenized format is inspired by event-based MIDI encodings, often used in symbolic music generation models. The dataset is released with an encoder/decoder which converts GuitarPro files to tokens and back. We present results of a use case in which DadaGP is used to train a Transformer-based model to generate new songs in GuitarPro format. We discuss other relevant use cases for the dataset (guitar-bass transcription, music style transfer and artist/genre classification) as well as ethical implications. DadaGP opens up the possibility to train GuitarPro score generators, fine-tune models on custom data, create new styles of music, AI-powered songwriting apps, and human-AI improvisation.
翻译:发源于数字时代的文艺复兴和发起, 制表器是一种常用的音乐标记系统, 提供工具指针而不是投球的清晰表达。 吉他Pro 已经将自己确立为一种广泛使用的制表格式和软件, 使音乐家能够编辑和分享歌曲, 用于音乐练习、 学习和组成。 在这项工作中, 我们介绍了DadaGP, 这是一个新的象征性音乐数据集, 由吉他Pro 格式中的26, 181个歌曲分组成, 涵盖739个音乐元体, 以及一个配有象征式格式, 适合变异器等基因序列模型的符号化模式。 代号格式受基于事件的 MIDI 编码的启发, 通常用于象征性的音乐生成模型 。 该数据集以编码/ 将 GitarPro 文档转换成符号或背面。 我们介绍了一个使用 DadaGP 来训练基于变异器的歌曲模型以生成新的歌曲, GuitarPro 格式。 我们讨论数据设置的其他相关使用案例( 吉他- 转录、 音乐风格转移和动动动动动动动器到Diral- regial- registrateal implade- registration imateal imation imational imationalational delgistrationsal del) 可能 新的数据, 将数据转换成新的数据转换成新的数字。