音乐LM: 从文字生成音乐 (MusicLM: Generating Music From Text)

Andrea Agostinelli,Timo I. Denk,Zalán Borsos,Jesse Engel,Mauro Verzetti,Antoine Caillon,Qingqing Huang,Aren Jansen,Adam Roberts,Marco Tagliasacchi,Matt Sharifi,Neil Zeghidour,Christian Frank

from arxiv, Supplementary material at https://google-research.github.io/seanet/musiclm/examples and https://kaggle.com/datasets/googleai/musiccaps

We introduce MusicLM, a model generating high-fidelity music from text descriptions such as "a calming violin melody backed by a distorted guitar riff". MusicLM casts the process of conditional music generation as a hierarchical sequence-to-sequence modeling task, and it generates music at 24 kHz that remains consistent over several minutes. Our experiments show that MusicLM outperforms previous systems both in audio quality and adherence to the text description. Moreover, we demonstrate that MusicLM can be conditioned on both text and a melody in that it can transform whistled and hummed melodies according to the style described in a text caption. To support future research, we publicly release MusicCaps, a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts.

翻译：我们引入了MusicLM(MusicLM ), 音乐LM(MusicLM ), 以文字描述(比如“由扭曲的吉他支持的平息小提琴旋律 ” ) 为题材, 音乐LM(MusicLM ) 将有条件的音乐生成过程作为按顺序顺序顺序排序的建模任务, 并在24 kHz(24 kHz) 生成音乐, 持续了几分钟。我们的实验显示MusicLM(MusicLM)在音质和对文字描述的坚持性两方面都超越了先前的系统。此外, 我们证明音乐LM(M) 可以同时以文字和旋律为条件, 因为它可以按照文字说明中描述的风格转换口哨和曲调。为了支持未来的研究, 我们公开发行了由5.5k音乐文本配对组成的数据集, 由人类专家提供丰富的文字描述。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日