Modern digital music production typically involves combining numerous acoustic elements to compile a piece of music. Important types of such elements are drum samples, which determine the characteristics of the percussive components of the piece. Artists must use their aesthetic judgement to assess whether a given drum sample fits the current musical context. However, selecting drum samples from a potentially large library is tedious and may interrupt the creative flow. In this work, we explore the automatic drum sample retrieval based on aesthetic principles learned from data. As a result, artists can rank the samples in their library by fit to some musical context at different stages of the production process (i.e., by fit to incomplete song mixtures). To this end, we use contrastive learning to maximize the score of drum samples originating from the same song as the mixture. We conduct a listening test to determine whether the human ratings match the automatic scoring function. We also perform objective quantitative analyses to evaluate the efficacy of our approach.
翻译:现代数字音乐制作通常需要结合多种声学元素来编集音乐片段。这类元素的重要类型是鼓样,它决定了鼓声组件的特性。艺术家必须利用他们的审美判断来评估某个鼓样样是否符合当前的音乐环境。然而,从潜在的大型图书馆挑选鼓样是乏味的,可能会干扰创作流。在这项工作中,我们探索基于从数据中学会的审美原理的自动鼓样检索。因此,艺术家可以在制作过程的不同阶段(即适合不完全的歌曲混合物),根据某种音乐背景在图书馆中排列样品。为此,我们利用对比性学习来尽量扩大来自与混合物同首歌的鼓样的分数。我们进行监听测试,以确定人类的评分是否与自动评分功能相符。我们还进行客观的定量分析,以评价我们方法的功效。