Coherence and interestingness are two criteria for evaluating the performance of melody harmonization, which aims to generate a chord progression from a symbolic melody. In this study, we apply the concept of orderless NADE, which takes the melody and its partially masked chord sequence as the input of the BiLSTM-based networks to learn the masked ground truth, to the training process. In addition, the class weights are used to compensate for some reasonable chord labels that are rarely seen in the training set. Consistent with the stochasticity in training, blocked Gibbs sampling with proper numbers of masking/generating loops is used in the inference phase to progressively trade the coherence of the generated chord sequence off against its interestingness. The experiments were conducted on a dataset of 18,005 melody/chord pairs. Our proposed model outperforms the state-of-the-art system MTHarmonizer in five of six different objective metrics based on chord/melody harmonicity and chord progression. The subjective test results with more than 100 participants also show the superiority of our model.
翻译:一致性和趣味性是评价旋律协调性表现的两个标准,其目的是从象征性旋律中产生和弦进取。在本研究中,我们应用无序NADE的概念,将旋律及其部分遮蔽的和弦序列作为BILSTM网络的输入,以了解掩盖的地面真相。此外,班级重量还用来补偿在培训中很少见的一些合理的和弦标签。与培训中的随机性一致,在推断阶段,用适当数量的遮蔽/生成环阻碍Gibs取样,以逐步交换生成的和弦序列的连贯性,使其与它的有趣性脱钩。实验是在18 005米和弦的数据集上进行的。我们提议的模型超越了基于合音/合音调和弦进化和弦进化的六种不同客观指标中的五种。100多名参与者的主观测试结果也显示了我们模型的优越性。