通过语义相似度评分的镜头,为注重效果的NLG计量评价提供动态、有口译员的核对员 (A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating) - 专知论文

会员服务 ·

0

语义相似度 · 相似度 · 图 · 讲稿 · Pair ·

2022 年 5 月 24 日

A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating

翻译：通过语义相似度评分的镜头,为注重效果的NLG计量评价提供动态、有口译员的核对员

Laura Zeidler,Juri Opitz,Anette Frank

from arxiv, to appear in *SEM 2022

Evaluating the quality of generated text is difficult, since traditional NLG evaluation metrics, focusing more on surface form than meaning, often fail to assign appropriate scores. This is especially problematic for AMR-to-text evaluation, given the abstract nature of AMR. Our work aims to support the development and improvement of NLG evaluation metrics that focus on meaning, by developing a dynamic CheckList for NLG metrics that is interpreted by being organized around meaning-relevant linguistic phenomena. Each test instance consists of a pair of sentences with their AMR graphs and a human-produced textual semantic similarity or relatedness score. Our CheckList facilitates comparative evaluation of metrics and reveals strengths and weaknesses of novel and traditional metrics. We demonstrate the usefulness of CheckList by designing a new metric GraCo that computes lexical cohesion graphs over AMR concepts. Our analysis suggests that GraCo presents an interesting NLG metric worth future investigation and that meaning-oriented NLG metrics can profit from graph-based metric components using AMR.

翻译：由于传统的NLG评价指标更侧重于表面形式而不是意义,因此很难评估生成文本的质量,因为传统的NLG评价指标往往无法分配适当的分数,鉴于AMR的抽象性质,这对AMR对文本的评价特别成问题。我们的工作旨在通过为NLG指标制定一个动态核对表,按照与意义相关的语言现象来解释,支持制定和改进侧重于含义的NLG评价指标。每个试验实例包括一对带有AMR图的句子和由人制作的文本的语义相似性或关联性评分。我们的校对表有助于对指标进行比较评价,并揭示新颖和传统指标的优缺点。我们通过设计一个新的将词汇表与AMR概念相匹配的词汇组来显示校验者的效用。我们的分析表明,GRACO提出了值得今后调查的有趣的NLG指标,而注重意义的NLG指标可以从使用AMR的图表的指数组成部分中获益。

0

相关内容

语义相似度

语义相似度

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于二硫化钼纳米生物传感器检测微系统的构建及其在肺癌早期检测中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

MR成像检测动脉粥样硬化及心肌缺血梗死组织中Tenascin-C蛋白表达的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

近红外荧光/磁性Endoglin适配体靶向肺癌双模态成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

难治性精神分裂症及其MECT治疗的脑网络特征研究

国家自然科学基金

0+阅读 · 2014年12月31日

早期多发性肌炎的磁共振成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

Nen-CO2和Arn-CO2复合物中红外激光光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

侧铣加工自由曲面刀具路径轨迹规划关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于语言理解的机器翻译方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

汉语依存分析的概率化决策动作模型及自适应技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

M-FUSE: Multi-frame Fusion for Scene Flow Estimation

Arxiv

0+阅读 · 2022年7月12日

Neural Video Compression using GANs for Detail Synthesis and Propagation

Arxiv

0+阅读 · 2022年7月12日

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Arxiv

0+阅读 · 2022年7月12日

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Arxiv

0+阅读 · 2022年7月11日

SummScore: A Comprehensive Evaluation Metric for Summary Quality Based on Cross-Encoder

Arxiv

0+阅读 · 2022年7月11日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

Arxiv

17+阅读 · 2021年6月25日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

语义相似度

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用大语言模型（LLM）优化海军陆战队经验教训学习》2025年最新103页

《加拿大陆军顶层作战概念》2025最新33页

超越第一人称视角（FPV）无人机：汲取俄乌战争的全部教训

《瓦洛伦斯（ValoRens）项目 - 预测分析：解读敌方意图》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

M-FUSE: Multi-frame Fusion for Scene Flow Estimation

Arxiv

0+阅读 · 2022年7月12日

Neural Video Compression using GANs for Detail Synthesis and Propagation

Arxiv

0+阅读 · 2022年7月12日

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Arxiv

0+阅读 · 2022年7月12日

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Arxiv

0+阅读 · 2022年7月11日

SummScore: A Comprehensive Evaluation Metric for Summary Quality Based on Cross-Encoder

Arxiv

0+阅读 · 2022年7月11日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

Arxiv

17+阅读 · 2021年6月25日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

基于二硫化钼纳米生物传感器检测微系统的构建及其在肺癌早期检测中的应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

MR成像检测动脉粥样硬化及心肌缺血梗死组织中Tenascin-C蛋白表达的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

近红外荧光/磁性Endoglin适配体靶向肺癌双模态成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

难治性精神分裂症及其MECT治疗的脑网络特征研究

国家自然科学基金

0+阅读 · 2014年12月31日

早期多发性肌炎的磁共振成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

Nen-CO2和Arn-CO2复合物中红外激光光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

侧铣加工自由曲面刀具路径轨迹规划关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于语言理解的机器翻译方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

汉语依存分析的概率化决策动作模型及自适应技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员