SESCore2:检索增加的文本生成评价培训前培训 (SEScore2: Retrieval Augmented Pretraining for Text Generation Evaluation) - 专知论文

会员服务 ·

0

相关系数 · 任务对话系统 · 无监督 · Learning · BLEURT ·

2022 年 12 月 19 日

SEScore2: Retrieval Augmented Pretraining for Text Generation Evaluation

翻译：SESCore2:检索增加的文本生成评价培训前培训

Wenda Xu,Xian Qian,Mingxuan Wang,Lei Li,William Yang Wang

Is it possible to leverage large scale raw and raw parallel corpora to build a general learned metric? Existing learned metrics have gaps to human judgements, are model-dependent or are limited to the domains or tasks where human ratings are available. In this paper, we propose SEScore2, a model-based metric pretrained over million-scale synthetic dataset constructed by our novel retrieval augmented data synthesis pipeline. SEScore2 achieves high correlation to human judgements without any human rating supervisions. Importantly, our unsupervised SEScore2 can outperform supervised metrics, which are trained on the News human ratings, at the TED domain. We evaluate SEScore2 over four text generation tasks across three languages. SEScore2 outperforms all prior unsupervised evaluation metrics in machine translation, speech translation, data-to-text and dialogue generation, with average Kendall improvements 0.158. SEScore2 even outperforms SOTA supervised BLEURT at data-to-text, dialogue generation and overall correlation.

翻译：能否利用大规模原始和原始平行公司来构建一个普遍学习的衡量标准? 现有的学习指标在人类判断方面存在差距,取决于模型,或限于人类评级的领域或任务。在本文中,我们提议SEScore2,一个由我们的新检索所建造的、以模型为基础的、预先训练的超过100万比例的合成数据集。SEScore2在没有任何人类评级监督的情况下,实现了与人类判断的高度相关性。重要的是,我们未经监督的SEScore2, 能够优于在TED领域接受关于《新闻》人类评级培训的受监督的衡量标准。我们评估SEScore2, 而不是在三种语言的四种文本生成任务中。SEScore2比所有先前未经监督的机器翻译、语音翻译、数据对文本和对话生成的评价指标都好,平均Kendall改进0.18。SEScore2甚至比SOTA还差。在数据对文本、对话生成和总体相关性方面,SLEURRT监督的SERT。

0

相关内容

相关系数

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

黄芪通过抑制mTORC1信号通路，活化自噬，延缓糖尿病肾病进展的研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子识别功能纳米核壳组装体构造及其金属增强荧光效应

国家自然科学基金

0+阅读 · 2015年12月31日

自噬在糖尿病肾病中的作用及姜黄素的干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

插层与Te元素掺杂对FeSe超导体系磁通钉扎机制的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

二烯丙基二硫阻断Rac1信号通路抑制结肠癌细胞EMT的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

半导体和半导体微结构中自旋相关的新奇效应

国家自然科学基金

2+阅读 · 2012年12月31日

脑肠肽ghrelin与帕金森病早期发生发展的关系研究

国家自然科学基金

0+阅读 · 2011年12月31日

多自由度哈密顿系统的动力学不稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

Quantized Compressed Sensing with Score-Based Generative Models

Arxiv

0+阅读 · 2023年2月17日

LEVER: Learning to Verify Language-to-Code Generation with Execution

Arxiv

0+阅读 · 2023年2月16日

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

Arxiv

0+阅读 · 2023年2月16日

Retrieval-augmented Image Captioning

Arxiv

0+阅读 · 2023年2月16日

Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods

Arxiv

0+阅读 · 2023年2月16日

Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models

Arxiv

0+阅读 · 2023年2月14日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

GeomCA: Geometric Evaluation of Data Representations

GeomCA: Geometric Evaluation of Data Representations

Arxiv

11+阅读 · 2021年5月26日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Quantized Compressed Sensing with Score-Based Generative Models

Arxiv

0+阅读 · 2023年2月17日

LEVER: Learning to Verify Language-to-Code Generation with Execution

Arxiv

0+阅读 · 2023年2月16日

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

Arxiv

0+阅读 · 2023年2月16日

Retrieval-augmented Image Captioning

Arxiv

0+阅读 · 2023年2月16日

Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods

Arxiv

0+阅读 · 2023年2月16日

Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models

Arxiv

0+阅读 · 2023年2月14日

An Overview on Machine Translation Evaluation

An Overview on Machine Translation Evaluation

Arxiv

14+阅读 · 2022年2月22日

GeomCA: Geometric Evaluation of Data Representations

GeomCA: Geometric Evaluation of Data Representations

Arxiv

11+阅读 · 2021年5月26日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

相关基金

黄芪通过抑制mTORC1信号通路，活化自噬，延缓糖尿病肾病进展的研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子识别功能纳米核壳组装体构造及其金属增强荧光效应

国家自然科学基金

0+阅读 · 2015年12月31日

自噬在糖尿病肾病中的作用及姜黄素的干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

插层与Te元素掺杂对FeSe超导体系磁通钉扎机制的影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

二烯丙基二硫阻断Rac1信号通路抑制结肠癌细胞EMT的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

半导体和半导体微结构中自旋相关的新奇效应

国家自然科学基金

2+阅读 · 2012年12月31日

脑肠肽ghrelin与帕金森病早期发生发展的关系研究

国家自然科学基金

0+阅读 · 2011年12月31日

多自由度哈密顿系统的动力学不稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员