In this paper, we focus on singing techniques within the scope of music information retrieval research. We investigate how singers use singing techniques using real-world recordings of famous solo singers in Japanese popular music songs (J-POP). First, we built a new dataset of singing techniques. The dataset consists of 168 commercial J-POP songs, and each song is annotated using various singing techniques with timestamps and vocal pitch contours. We also present descriptive statistics of singing techniques on the dataset to clarify what and how often singing techniques appear. We further explored the difficulty of the automatic detection of singing techniques using previously proposed machine learning techniques. In the detection, we also investigate the effectiveness of auxiliary information (i.e., pitch and distribution of label duration), not only providing the baseline. The best result achieves 40.4% at macro-average F-measure on nine-way multi-class detection. We provide the annotation of the dataset and its detail on the appendix website 0 .
翻译:在本文中,我们侧重于音乐信息检索研究范围内的歌唱技巧。我们调查歌手如何使用在日本流行音乐歌曲(J-POP)中名独唱歌手真实世界录音的歌唱技巧。首先,我们建立了一个关于歌唱技巧的新数据集。数据集由168个商业J-POP歌曲组成,每首歌曲都有附加注释,使用各种歌唱技巧加上时标和声声声声声声轮廓。我们还在数据集中提供歌唱技巧的描述性统计数据,以澄清歌唱技巧的出现和出现频率。我们进一步探讨了利用先前提议的机器学习技巧自动探测歌唱技巧的困难。在探测中,我们还调查辅助信息(即标签持续时间的播音和分配)的有效性,不仅提供了基线。最佳结果是在九道多级探测的宏观平均F度上达到40.4%。我们在附录网站0上提供了数据集及其细节的说明。