含有深平衡模型的音乐源分离 (Music Source Separation with Deep Equilibrium Models)

While deep neural network-based music source separation (MSS) is very effective and achieves high performance, its model size is often a problem for practical deployment. Deep implicit architectures such as deep equilibrium models (DEQ) were recently proposed, which can achieve higher performance than their explicit counterparts with limited depth while keeping the number of parameters small. This makes DEQ also attractive for MSS, especially as it was originally applied to sequential modeling tasks in natural language processing and thus should in principle be also suited for MSS. However, an investigation of a good architecture and training scheme for MSS with DEQ is needed as the characteristics of acoustic signals are different from those of natural language data. Hence, in this paper we propose an architecture and training scheme for MSS with DEQ. Starting with the architecture of Open-Unmix (UMX), we replace its sequence model with DEQ. We refer to our proposed method as DEQ-based UMX (DEQ-UMX). Experimental results show that DEQ-UMX performs better than the original UMX while reducing its number of parameters by 30%.

翻译：虽然深神经网络的音乐源分离(MSS)非常有效,并取得了很高的性能,但其模型规模往往是一个实际部署的问题。最近提出了深平衡模型(DEQ)等深隐含结构,其性能可以高于其直线对应方,深度有限,但参数数量小。这使得DEQ对MSS也具有吸引力,特别是因为它最初应用于自然语言处理中的顺序建模任务,因此原则上也应适用于MSS。然而,由于音频信号的特征不同于自然语言数据,因此需要对具有DEQ的MSS的良好架构和培训计划进行调查。因此,在本文件中,我们提出了与DEQ的MSS的架构和培训计划。从Open-Unmix(UMX)的架构开始,我们用DEQ取代其序列模型。我们称之为基于DEQ UMX(DEQ-UMX)的序列。我们提出的方法,即以DEQ UMX(DEQ-UMX)为基础。实验结果表明,DEQ-UMX在将参数数目减少30%的同时,其表现优于原UX。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日