We discuss a novel task, Chorus Recognition, which could potentially benefit downstream tasks such as song search and music summarization. Different from the existing tasks such as music summarization or lyrics summarization relying on single-modal information, this paper models chorus recognition as a multi-modal one by utilizing both the lyrics and the tune information of songs. We propose a multi-modal Chorus Recognition model that considers diverse features. Besides, we also create and publish the first Chorus Recognition dataset containing 627 songs for public use. Our empirical study performed on the dataset demonstrates that our approach outperforms several baselines in chorus recognition. In addition, our approach also helps to improve the accuracy of its downstream task - song search by more than 10.6%.
翻译:我们讨论的是新颖的任务,即合唱承认,这有可能有益于下游任务,如歌曲搜索和音乐总结。不同于现有任务,如音乐总结或歌词总结,依赖单一模式信息,本文模型通过歌词和歌曲调调信息,将合唱承认为多模式。我们提出了多模式合唱承认模式,考虑到不同特点。此外,我们还创建并出版了首个合唱承认数据集,其中载有627首歌,供公众使用。我们在数据集上进行的经验研究表明,我们的方法超越了合唱承认方面的几个基线。此外,我们的方法还有助于提高其下游任务的准确性,即歌曲搜索率超过10.6%。