In this paper, the dataset used for the data challenge organised by Conference on Sound and Music Technology (CSMT) is introduced. The CSMT data challenge requires participants to identify whether a given piece of melody is generated by computer or is composed by human. The dataset is formed by two parts: development dataset and evaluation dataset. The development dataset contains only computer generated melodies whereas the evaluation dataset contain both computer generated melodies and human composed melodies. The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.
翻译:在本文中,引入了用于声音和音乐技术会议(CSMT)组织的数据挑战的数据集。CSMT数据挑战要求参与者确定某一旋律是由计算机生成还是由人构成。数据集由两个部分组成:开发数据集和评价数据集。开发数据集仅包含计算机生成的旋律,而评价数据集包含计算机生成的旋律和人为的旋律。数据集的目的是研究是否有可能通过学习生成的旋律的特征来区分计算机生成的旋律。