关于 " 建设实用的NLP领导板:机器翻译案例 " 的讨论 (A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation)

Recent advances in AI and ML applications have benefited from rapid progress in NLP research. Leaderboards have emerged as a popular mechanism to track and accelerate progress in NLP through competitive model development. While this has increased interest and participation, the over-reliance on single, and accuracy-based metrics have shifted focus from other important metrics that might be equally pertinent to consider in real-world contexts. In this paper, we offer a preliminary discussion of the risks associated with focusing exclusively on accuracy metrics and draw on recent discussions to highlight prescriptive suggestions on how to develop more practical and effective leaderboards that can better reflect the real-world utility of models.

翻译：AI和ML应用的最近进展得益于国家劳工政策研究的迅速进展,领头板已成为通过竞争性模式开发来跟踪和加速国家劳工政策进展的流行机制,虽然这提高了人们的兴趣和参与程度,但过度依赖单一和基于准确度的衡量标准已经从在现实世界中可能同样相关的其他重要衡量标准转移了重点,在本文中,我们初步讨论了专门侧重于准确度量度指标的相关风险,并借鉴了最近的讨论,着重说明了关于如何制定更实际、更有效的、能够更好地反映模型在现实世界中的效用的规范性建议。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

专知会员服务

20+阅读 · 2020年1月7日