舞蹈2MIDI:由舞蹈驱动的多部门音乐创作 (Dance2MIDI: Dance-driven multi-instruments music generation) - 专知论文

会员服务 ·

0

相关系数 · Pair · 数据集 · HTTPS · Harmony ·

2023 年 1 月 22 日

Dance2MIDI: Dance-driven multi-instruments music generation

翻译：舞蹈2MIDI:由舞蹈驱动的多部门音乐创作

Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multiinstruments scenario is under-explored. The challenges of the dance-driven multi-instruments music (MIDI) generation are two-fold: 1) no publicly available multi-instruments MIDI and video paired dataset and 2) the weak correlation between music and video. To tackle these challenges, we build the first multi-instruments MIDI and dance paired dataset (D2MIDI). Based on our proposed dataset, we introduce a multi-instruments MIDI generation framework (Dance2MIDI) conditioned on dance video. Specifically, 1) to model the correlation between music and dance, we encode the dance motion using the GCN, and 2) to generate harmonious and coherent music, we employ Transformer to decode the MIDI sequence. We evaluate the generated music of our framework trained on D2MIDI dataset and demonstrate that our method outperforms existing methods. The data and code are available on https://github.com/Dance2MIDI/Dance2MIDI

翻译：由舞蹈驱动的音乐制作旨在产生以舞蹈录像为条件的音乐片段。以前的工作重点是单声或原始声频生成,而多种工具的情景则未得到充分探讨。舞蹈驱动的多工具音乐(MIDI)生成的挑战有两个方面:(1) 没有公开的多种工具MIDI和视频配对数据集,(2) 音乐和视频之间的相关性薄弱。为了应对这些挑战,我们建立了第一个多工具MIDI和舞蹈配对数据集(D2MIDI)。根据我们提议的数据集,我们引入了一个多工具MIDI生成框架(Dance2MIDI),以舞蹈视频为条件。具体来说,1)为模拟音乐与舞蹈之间的相互关系,我们用GCN对舞蹈运动进行编码,2)为产生和谐和连贯的音乐,我们使用变换器解码MIDI序列。我们评估了我们D2MIDI数据集培训的框架生成的音乐,并证明我们的方法超越了现有方法。数据和代码可以在 https://github.com/DINGIS2/DIMIMance上查阅。

0

相关内容

相关系数

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

52+阅读 · 2020年1月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

面向微视频情感分析的多通道特征学习关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

无穷Laplace方程解的边界正则性

国家自然科学基金

0+阅读 · 2013年12月31日

基于紧急异常声音事件检测与分类的音频监控系统方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

GLP-1/beta-catenin/TCF信号通路对糖尿病鼠心肌细胞凋亡的保护作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖尿病血管钙化的新机制：高糖诱导内皮细胞－成骨细胞转分化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

面向半监督数据集的智能软测量建模方法研究与应用

国家自然科学基金

1+阅读 · 2012年12月31日

开放域动态事实性信息获取及融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

补肾抗衰片动态调控HO-1/CO与NOS/NO系统微平衡稳定动脉粥样硬化斑块的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

Aerial-Ground Person Re-ID

Arxiv

0+阅读 · 2023年3月15日

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

Arxiv

0+阅读 · 2023年3月14日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年3月14日

SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields

Arxiv

0+阅读 · 2023年3月13日

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Arxiv

1+阅读 · 2023年3月13日

Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library

Arxiv

0+阅读 · 2023年3月11日

LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation

Arxiv

0+阅读 · 2023年3月10日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

52+阅读 · 2020年1月20日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Aerial-Ground Person Re-ID

Arxiv

0+阅读 · 2023年3月15日

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

Arxiv

0+阅读 · 2023年3月14日

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年3月14日

SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields

Arxiv

0+阅读 · 2023年3月13日

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Arxiv

1+阅读 · 2023年3月13日

Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library

Arxiv

0+阅读 · 2023年3月11日

LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation

Arxiv

0+阅读 · 2023年3月10日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

面向微视频情感分析的多通道特征学习关键技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

无穷Laplace方程解的边界正则性

国家自然科学基金

0+阅读 · 2013年12月31日

基于紧急异常声音事件检测与分类的音频监控系统方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

GLP-1/beta-catenin/TCF信号通路对糖尿病鼠心肌细胞凋亡的保护作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖尿病血管钙化的新机制：高糖诱导内皮细胞－成骨细胞转分化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

面向半监督数据集的智能软测量建模方法研究与应用

国家自然科学基金

1+阅读 · 2012年12月31日

开放域动态事实性信息获取及融合方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

补肾抗衰片动态调控HO-1/CO与NOS/NO系统微平衡稳定动脉粥样硬化斑块的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员