Music source separation has been intensively studied in the last decade and tremendous progress with the advent of deep learning could be observed. Evaluation campaigns such as MIREX or SiSEC connected state-of-the-art models and corresponding papers, which can help researchers integrate the best practices into their models. In recent years, however, it has become increasingly difficult to measure real-world performance as the music separation community had to rely on a limited amount of test data and was biased towards specific genres and mixing styles. To address these issues, we designed the Music Demixing (MDX) Challenge on a crowd-based machine learning competition platform where the task is to separate stereo songs into four instrument stems (Vocals, Drums, Bass, Other). The main differences compared with the past challenges are 1) the competition is designed to more easily allow machine learning practitioners from other disciplines to participate and 2) evaluation is done on a hidden test set created by music professionals dedicated exclusively to the challenge to assure the transparency of the challenge, i.e., the test set is not included in the training set. In this paper, we provide the details of the datasets, baselines, evaluation metrics, evaluation results, and technical challenges for future competitions.
翻译:近十年来,对音乐源的分离进行了深入研究,随着深层次学习的到来,可以观察到巨大的进展。MIREX或SISEC等评价运动与最先进的模型和相应论文相关联,有助于研究人员将最佳做法纳入模型中。然而,近年来,由于音乐分离社区不得不依赖数量有限的测试数据,偏向于特定的种类和混合风格,因此越来越难以衡量真实世界的性能。为了解决这些问题,我们设计了以人群为基础的机器学习竞争平台 " 音乐解密挑战 " (MDX),该平台的任务是将立体歌曲分为四个乐器(Vocals、Drums、Bass、Others)。与以往挑战相比的主要差异是:1) 竞争的目的是让机器学习其他学科的从业人员更容易地参与,2) 评估音乐专业工作者专门为确保挑战的透明度而创建的隐性测试,即测试组没有列入培训组。本文中,我们提供了数据集、基准、技术评估、未来评估结果的详情。