同时产生和分离音乐的多源传播模型 (Multi-Source Diffusion Models for Simultaneous Music Generation and Separation) - 专知论文

会员服务 ·

0

分离的 · MoDELS · 推断 · 样例 · 情景 ·

2023 年 2 月 7 日

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

翻译：同时产生和分离音乐的多源传播模型

Giorgio Mariani,Irene Tallini,Emilian Postolache,Michele Mancusi,Luca Cosmo,Emanuele Rodolà

from arxiv, Demo page: https://gladia-research-group.github.io/multi-source-diffusion-models/

In this work, we define a diffusion-based generative model capable of both music synthesis and source separation by learning the score of the joint probability density of sources sharing a context. Alongside the classic total inference tasks (i.e. generating a mixture, separating the sources), we also introduce and experiment on the partial inference task of source imputation, where we generate a subset of the sources given the others (e.g., play a piano track that goes well with the drums). Additionally, we introduce a novel inference method for the separation task. We train our model on Slakh2100, a standard dataset for musical source separation, provide qualitative results in the generation settings, and showcase competitive quantitative results in the separation setting. Our method is the first example of a single model that can handle both generation and separation tasks, thus representing a step toward general audio models.

翻译：在这项工作中,我们定义了一种基于传播的遗传模型,既能进行音乐合成,又能进行源分离,方法是通过学习对共享环境的源的共同概率密度的分数来确定一个基于音乐合成和源分离。除了传统的全部推断任务(即产生混合物,分离来源)之外,我们还引入和实验了源估算的部分推论任务,即产生源估算的子集(如弹奏钢琴曲,与鼓声相匹配)。此外,我们引入了一种新的分离任务推论方法。我们培训了我们的Slakh 2100模型,这是一个用于音乐源分离的标准数据集,在生成环境中提供定性结果,并在分离环境中展示有竞争力的定量结果。我们的方法是一个单一模型的首例,可以同时处理一代和分离任务,从而代表向一般音频模型迈出了一步。

0

相关内容

分离的

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

ATM基因在糖尿病大血管病变防治中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

微纳米尺度材料力学性能测量系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模态MRI的皮层下血管性痴呆早期诊断生物学标记研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于蛋白质组学和代谢组学整合分析的Paraconiothyrium variable GHJ-4降解木质素的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞外Hsp90alpha促进肿瘤侵袭的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

多视角下的多类型目标识别与行为分析

国家自然科学基金

2+阅读 · 2011年12月31日

高核稀土羟基簇合物的合成及性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

Discriminative Class Tokens for Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年3月30日

Audio-Visual Grouping Network for Sound Localization from Mixtures

Arxiv

0+阅读 · 2023年3月29日

WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models

Arxiv

0+阅读 · 2023年3月29日

Variational Distribution Learning for Unsupervised Text-to-Image Generation

Variational Distribution Learning for Unsupervised Text-to-Image Generation

Arxiv

0+阅读 · 2023年3月28日

Multimodal and multicontrast image fusion via deep generative models

Arxiv

0+阅读 · 2023年3月28日

A source separation approach to temporal graph modelling for computer networks

Arxiv

0+阅读 · 2023年3月28日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向性能、成本效益、云边隐私与可信性的大小语言模型协作综述

乌克兰太空研究（2022-2024年） | 176页

【CMU博士论文】大型语言模型的隐性特性

国防领域人工智能走向何方？

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Discriminative Class Tokens for Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年3月30日

Audio-Visual Grouping Network for Sound Localization from Mixtures

Arxiv

0+阅读 · 2023年3月29日

WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models

Arxiv

0+阅读 · 2023年3月29日

Variational Distribution Learning for Unsupervised Text-to-Image Generation

Variational Distribution Learning for Unsupervised Text-to-Image Generation

Arxiv

0+阅读 · 2023年3月28日

Multimodal and multicontrast image fusion via deep generative models

Arxiv

0+阅读 · 2023年3月28日

A source separation approach to temporal graph modelling for computer networks

Arxiv

0+阅读 · 2023年3月28日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

相关基金

ATM基因在糖尿病大血管病变防治中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

微纳米尺度材料力学性能测量系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模态MRI的皮层下血管性痴呆早期诊断生物学标记研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于蛋白质组学和代谢组学整合分析的Paraconiothyrium variable GHJ-4降解木质素的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞外Hsp90alpha促进肿瘤侵袭的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

多视角下的多类型目标识别与行为分析

国家自然科学基金

2+阅读 · 2011年12月31日

高核稀土羟基簇合物的合成及性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员