Generating melody from lyrics is an interesting yet challenging task in the area of artificial intelligence and music. However, the difficulty of keeping the consistency between input lyrics and generated melody limits the generation quality of previous works. In our proposal, we demonstrate our proposed interpretable lyrics-to-melody generation system which can interact with users to understand the generation process and recreate the desired songs. To improve the reliability of melody generation that matches lyrics, mutual information is exploited to strengthen the consistency between lyrics and generated melodies. Gumbel-Softmax is exploited to solve the non-differentiability problem of generating discrete music attributes by Generative Adversarial Networks (GANs). Moreover, the predicted probabilities output by the generator is utilized to recommend music attributes. Interacting with our lyrics-to-melody generation system, users can listen to the generated AI song as well as recreate a new song by selecting from recommended music attributes.
翻译:从歌词中生成旋律是人工智能和音乐领域一项有趣但富有挑战性的任务。 然而,保持投入歌词的一致性和生成旋律的难度限制了以往作品的生成质量。 在我们的提案中,我们展示了我们提议的可解释的歌词至融和生成系统,该系统可以与用户互动,以理解生成过程并重创想要的歌曲。为了提高旋律生成的可靠性,匹配歌词,相互信息被用来加强歌词和生成旋律的一致性。 Gumbel-Softmax被用于解决由Generation Aversarial Networks(GANs)生成离散音乐属性的非差异性问题。此外,发电机的预测概率输出被用于推荐音乐属性。与我们的歌词至融和生成系统互动,用户可以聆听生成的 AI 歌曲,并通过从推荐的音乐属性中选择再创建一首新歌。