我们需要谈谈随机分裂 (We Need to Talk About Random Splits) - 专知论文

会员服务 ·

0

协变量偏移 · 有偏 · Performer · 估计/估计量 · NLP ·

2021 年 1 月 26 日

We Need to Talk About Random Splits

翻译：我们需要谈谈随机分裂

Anders Søgaard,Sebastian Ebert,Jasmijn Bastings,Katja Filippova

from arxiv, Accepted at EACL 2021

Gorman and Bedrick (2019) argued for using random splits rather than standard splits in NLP experiments. We argue that random splits, like standard splits, lead to overly optimistic performance estimates. We can also split data in biased or adversarial ways, e.g., training on short sentences and evaluating on long ones. Biased sampling has been used in domain adaptation to simulate real-world drift; this is known as the covariate shift assumption. In NLP, however, even worst-case splits, maximizing bias, often under-estimate the error observed on new samples of in-domain data, i.e., the data that models should minimally generalize to at test time. This invalidates the covariate shift assumption. Instead of using multiple random splits, future benchmarks should ideally include multiple, independent test sets instead; if infeasible, we argue that multiple biased splits leads to more realistic performance estimates than multiple random splits.

翻译：Gorman和Bedrick (2019年) 主张在 NLP 实验中使用随机分解而不是标准分解。我们争辩说, 随机分解, 如标准分解, 会导致过度乐观的性能估计。我们还可以偏颇或对称的方式分割数据, 比如, 短刑期培训和长刑期评估。在模拟真实世界漂移的域适应中, 误差抽样被使用; 这被称为共变转换假设。但是, 在 NLP 实验中, 即使是最坏的分解, 最大偏差, 也往往低估新样本中观察到的误差, 即模型在测试时应尽量笼统化的数据。这否定了共变转换的假设。未来基准应该包括多个独立的测试组, 而不是多位随机分解; 如果不可行, 我们则认为, 多重偏差导致比多位随机拆分解更现实的性性性的工作估计。

0

相关内容

协变量偏移

协变量偏移

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】机器学习Primer，122页pdf

【干货书】机器学习Primer，122页pdf

专知会员服务

109+阅读 · 2020年10月5日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

BERT到底如何work的？A Primer in BERTology: What we know about how BERT works

BERT到底如何work的？A Primer in BERTology: What we know about how BERT works

专知会员服务

50+阅读 · 2020年2月28日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Efficient Subsampling for Generating High-Quality Images from Conditional Generative Adversarial Networks

Arxiv

0+阅读 · 2021年3月20日

SPlit: An Optimal Method for Data Splitting

Arxiv

0+阅读 · 2021年3月19日

Controlling False Discovery Rate Using Gaussian Mirrors

Arxiv

1+阅读 · 2021年3月19日

Estimation and false discovery control for the analysis of environmental mixtures

Arxiv

0+阅读 · 2021年3月18日

Optimal transport framework for efficient prototype selection

Arxiv

0+阅读 · 2021年3月18日

Understanding Generalization in Adversarial Training via the Bias-Variance Decomposition

Arxiv

0+阅读 · 2021年3月17日

Impact of the error structure on the design and analysis of enzyme kinetic models

Arxiv

0+阅读 · 2021年3月17日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

Multi-Task Learning with Labeled and Unlabeled Tasks

Arxiv

3+阅读 · 2017年6月8日

VIP会员

文章信息

相关主题

协变量偏移

估计/估计量

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】机器学习Primer，122页pdf

【干货书】机器学习Primer，122页pdf

专知会员服务

109+阅读 · 2020年10月5日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

现代机器学习技术导论，596页pdf

专知会员服务

167+阅读 · 2020年7月27日

BERT到底如何work的？A Primer in BERTology: What we know about how BERT works

BERT到底如何work的？A Primer in BERTology: What we know about how BERT works

专知会员服务

50+阅读 · 2020年2月28日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Efficient Subsampling for Generating High-Quality Images from Conditional Generative Adversarial Networks

Arxiv

0+阅读 · 2021年3月20日

SPlit: An Optimal Method for Data Splitting

Arxiv

0+阅读 · 2021年3月19日

Controlling False Discovery Rate Using Gaussian Mirrors

Arxiv

1+阅读 · 2021年3月19日

Estimation and false discovery control for the analysis of environmental mixtures

Arxiv

0+阅读 · 2021年3月18日

Optimal transport framework for efficient prototype selection

Arxiv

0+阅读 · 2021年3月18日

Understanding Generalization in Adversarial Training via the Bias-Variance Decomposition

Arxiv

0+阅读 · 2021年3月17日

Impact of the error structure on the design and analysis of enzyme kinetic models

Arxiv

0+阅读 · 2021年3月17日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

Multi-Task Learning with Labeled and Unlabeled Tasks

Arxiv

3+阅读 · 2017年6月8日

微信扫码咨询专知VIP会员