使用音频化预培训语言模型翻译歌曲歌词 (Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model) - 专知论文

会员服务 ·

0

语言模型化 · 可理解性 · MoDELS · 原点 · INFORMS ·

2022 年 8 月 24 日

Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model

翻译：使用音频化预培训语言模型翻译歌曲歌词

Yixiao Zhang,Junyan Jiang,Gus Xia,Simon Dixon

from arxiv, Accepted to ISMIR 2022

Lyric interpretations can help people understand songs and their lyrics quickly, and can also make it easier to manage, retrieve and discover songs efficiently from the growing mass of music archives. In this paper we propose BART-fusion, a novel model for generating lyric interpretations from lyrics and music audio that combines a large-scale pre-trained language model with an audio encoder. We employ a cross-modal attention module to incorporate the audio representation into the lyrics representation to help the pre-trained language model understand the song from an audio perspective, while preserving the language model's original generative performance. We also release the Song Interpretation Dataset, a new large-scale dataset for training and evaluating our model. Experimental results show that the additional audio information helps our model to understand words and music better, and to generate precise and fluent interpretations. An additional experiment on cross-modal music retrieval shows that interpretations generated by BART-fusion can also help people retrieve music more accurately than with the original BART.

翻译：流言解释可以帮助人们快速理解歌曲和歌词,还可以使人们更容易从越来越多的音乐档案库中有效地管理、检索和发现歌曲。在本文中,我们提议了BART-sult,这是从歌词和音乐音频中产生歌词解释的新模式,将大型的预培训语言模型与音调编码器结合起来。我们使用一个跨模式关注模块,将音频表述纳入歌词表述中,以帮助经过培训的语文模型从音频角度理解歌曲,同时保留语言模型的原始发型性能。我们还发布了歌曲解释数据集,这是用于培训和评估模型的新的大规模数据集。实验结果显示,补充音频信息有助于我们的模型更好地了解文字和音乐,并产生准确和流畅的解释。关于跨模式的音乐检索的额外实验显示,由BART-volution产生的解释也能够帮助人们比原始BART更准确地检索音乐。

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

互联网与数学文化传播研讨会

国家自然科学基金

1+阅读 · 2018年9月23日

救必应黄酮类成分抗产ESBLs大肠杆菌作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于代谢组学的“附子理中汤”治疗脾虚证的药效物质基础与作用机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

垄断及双寡头市场条件下企业级软件交付模式的研究

国家自然科学基金

2+阅读 · 2013年12月31日

MicroRNA-10a/b靶向调控ABCA1和ABCG1对胆固醇流出的影响

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对牛卵母细胞体外成熟的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于RAGE靶标研究生地山茱萸环烯醚萜苷类成分干预糖尿病肾病的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

汉越双语语料库建设及词对齐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

格构式高耸结构基于大位移和稳定性的抗风优化设计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PCOS患者卵巢颗粒细胞对卵子及早期胚胎发育潜能的基因调控

国家自然科学基金

0+阅读 · 2011年12月31日

Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics

Arxiv

0+阅读 · 2022年10月5日

And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Arxiv

0+阅读 · 2022年10月3日

BVI-VFI: A Video Quality Database for Video Frame Interpolation

Arxiv

0+阅读 · 2022年10月3日

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Arxiv

0+阅读 · 2022年10月3日

PoLyScriber: Integrated Training of Extractor and Lyrics Transcriber for Polyphonic Music

Arxiv

0+阅读 · 2022年10月2日

Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

Arxiv

0+阅读 · 2022年10月1日

Language Models Can Teach Themselves to Program Better

Arxiv

0+阅读 · 2022年9月30日

Sequence Level Contrastive Learning for Text Summarization

Sequence Level Contrastive Learning for Text Summarization

Arxiv

14+阅读 · 2021年9月24日

iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

Arxiv

17+阅读 · 2021年6月25日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics

Arxiv

0+阅读 · 2022年10月5日

And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Arxiv

0+阅读 · 2022年10月3日

BVI-VFI: A Video Quality Database for Video Frame Interpolation

Arxiv

0+阅读 · 2022年10月3日

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Arxiv

0+阅读 · 2022年10月3日

PoLyScriber: Integrated Training of Extractor and Lyrics Transcriber for Polyphonic Music

Arxiv

0+阅读 · 2022年10月2日

Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

Arxiv

0+阅读 · 2022年10月1日

Language Models Can Teach Themselves to Program Better

Arxiv

0+阅读 · 2022年9月30日

Sequence Level Contrastive Learning for Text Summarization

Sequence Level Contrastive Learning for Text Summarization

Arxiv

14+阅读 · 2021年9月24日

iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

Arxiv

17+阅读 · 2021年6月25日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

相关基金

互联网与数学文化传播研讨会

国家自然科学基金

1+阅读 · 2018年9月23日

救必应黄酮类成分抗产ESBLs大肠杆菌作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于代谢组学的“附子理中汤”治疗脾虚证的药效物质基础与作用机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

垄断及双寡头市场条件下企业级软件交付模式的研究

国家自然科学基金

2+阅读 · 2013年12月31日

MicroRNA-10a/b靶向调控ABCA1和ABCG1对胆固醇流出的影响

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对牛卵母细胞体外成熟的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于RAGE靶标研究生地山茱萸环烯醚萜苷类成分干预糖尿病肾病的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

汉越双语语料库建设及词对齐方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

格构式高耸结构基于大位移和稳定性的抗风优化设计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

PCOS患者卵巢颗粒细胞对卵子及早期胚胎发育潜能的基因调控

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员