Eval4NLP 关于可解释的质量估算:概览和结果的共同任务 (The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results) - 专知论文

会员服务 ·

0

可辨认的 · 得分 · Pair · HTTPS · NLP ·

2021 年 10 月 8 日

The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results

翻译：Eval4NLP 关于可解释的质量估算:概览和结果的共同任务

Marina Fomicheva,Piyawat Lertvittayakumjorn,Wei Zhao,Steffen Eger,Yang Gao

In this paper, we introduce the Eval4NLP-2021shared task on explainable quality estimation. Given a source-translation pair, this shared task requires not only to provide a sentence-level score indicating the overall quality of the translation, but also to explain this score by identifying the words that negatively impact translation quality. We present the data, annotation guidelines and evaluation setup of the shared task, describe the six participating systems, and analyze the results. To the best of our knowledge, this is the first shared task on explainable NLP evaluation metrics. Datasets and results are available at https://github.com/eval4nlp/SharedTask2021.

翻译：在本文中,我们介绍了Eval4NLP-2021关于可解释质量估算的共享任务。根据对来源翻译的配对,这一共同任务不仅需要提供一个判决级评分,表明翻译的总体质量,而且还需要通过确定对翻译质量有负面影响的词来解释这一评分。我们介绍了对共同任务的数据、批注指南和评价设置,描述了六个参与系统,并分析了结果。据我们所知,这是关于可解释的NLP评价指标的第一项共同任务。数据集和结果可在https://github.com/eval4np/SharedTask2021上查阅。

0

相关内容

可辨认的

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

101+阅读 · 2020年10月13日

鲁棒模式识别研究进展

鲁棒模式识别研究进展

专知会员服务

41+阅读 · 2020年8月9日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【微众银行】联邦学习白皮书_v2.0，48页pdf，

【微众银行】联邦学习白皮书_v2.0，48页pdf，

专知会员服务

170+阅读 · 2020年4月26日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月4日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

An Objective Metric for Explainable AI: How and Why to Estimate the Degree of Explainability

Arxiv

0+阅读 · 2021年12月8日

Multinational Address Parsing: A Zero-Shot Evaluation

Arxiv

0+阅读 · 2021年12月7日

Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing

Arxiv

0+阅读 · 2021年12月7日

What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods

Arxiv

0+阅读 · 2021年12月6日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2021年12月6日

Flexible Instance-Specific Rationalization of NLP Models

Arxiv

0+阅读 · 2021年12月6日

Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

Arxiv

0+阅读 · 2021年12月6日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

Explainable Recommendation: A Survey and New Perspectives

Arxiv

11+阅读 · 2018年5月13日

Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions

Arxiv

9+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

相关VIP内容

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

101+阅读 · 2020年10月13日

鲁棒模式识别研究进展

鲁棒模式识别研究进展

专知会员服务

41+阅读 · 2020年8月9日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【微众银行】联邦学习白皮书_v2.0，48页pdf，

【微众银行】联邦学习白皮书_v2.0，48页pdf，

专知会员服务

170+阅读 · 2020年4月26日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

【Github】All4NLP：自然语言处理相关资源整理

【Github】All4NLP：自然语言处理相关资源整理

AINLP

23+阅读 · 2019年8月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月4日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

An Objective Metric for Explainable AI: How and Why to Estimate the Degree of Explainability

Arxiv

0+阅读 · 2021年12月8日

Multinational Address Parsing: A Zero-Shot Evaluation

Arxiv

0+阅读 · 2021年12月7日

Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing

Arxiv

0+阅读 · 2021年12月7日

What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods

Arxiv

0+阅读 · 2021年12月6日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2021年12月6日

Flexible Instance-Specific Rationalization of NLP Models

Arxiv

0+阅读 · 2021年12月6日

Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

Arxiv

0+阅读 · 2021年12月6日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

Explainable Recommendation: A Survey and New Perspectives

Arxiv

11+阅读 · 2018年5月13日

Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions

Arxiv

9+阅读 · 2018年1月27日

微信扫码咨询专知VIP会员