使用 $\ mathcal{V}$- 无法使用的信息来理解数据集难度 (Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information) - 专知论文

会员服务 ·

0

可理解性 · 数据集 · INFORMS · MoDELS · 示例 ·

2022 年 6 月 15 日

Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information

翻译：使用 $\ mathcal{V}$- 无法使用的信息来理解数据集难度

Kawin Ethayarajh,Yejin Choi,Swabha Swayamdipta

from arxiv, ICML 2022 (long talk)

Estimating the difficulty of a dataset typically involves comparing state-of-the-art models to humans; the bigger the performance gap, the harder the dataset is said to be. However, this comparison provides little understanding of how difficult each instance in a given distribution is, or what attributes make the dataset difficult for a given model. To address these questions, we frame dataset difficulty -- w.r.t. a model $\mathcal{V}$ -- as the lack of $\mathcal{V}$-$\textit{usable information}$ (Xu et al., 2019), where a lower value indicates a more difficult dataset for $\mathcal{V}$. We further introduce $\textit{pointwise $\mathcal{V}$-information}$ (PVI) for measuring the difficulty of individual instances w.r.t. a given distribution. While standard evaluation metrics typically only compare different models for the same dataset, $\mathcal{V}$-$\textit{usable information}$ and PVI also permit the converse: for a given model $\mathcal{V}$, we can compare different datasets, as well as different instances/slices of the same dataset. Furthermore, our framework allows for the interpretability of different input attributes via transformations of the input, which we use to discover annotation artefacts in widely-used NLP benchmarks.

翻译：估计一个数据集的难度通常涉及将最先进的模型与人类进行比较;性能差距越大,数据集就越难。然而,这种比较使人们对特定分布中每个实例的难度或使给定模式的数据集难于使用什么属性表示不甚理解。为了解决这些问题,我们设置了数据集难度 -- -- w.r.t. 模型$mathcal{V}{V}美元 -- -- 因为缺少美元=mathcal{V}-$\textit{可使用的基准}$(Xu et al., 2019) (Xu et al., 2019) -- -- 低值显示美元=mathcal{V} 的数据集难度越大。我们进一步引入 $\ textitit{point wixy $\mathcal{V}- info}$(PVI) 来衡量单个案例的难度。虽然标准评价指标通常只比较同一数据集的不同模型的不同模型的模型, $\math calice{$-textitrifliet flefile} $和PVI 也允许对不同的数据进行对比。

0

相关内容

可理解性

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Mg-RE-TM合金强化相中的点缺陷及其对力学性能的影响

国家自然科学基金

0+阅读 · 2014年12月31日

单相高熵合金凝固过程中固液界面特性及行为

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

定向凝固钛铝合金熔体与铸型涂层界面反应微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Mg-Al-Ca-Sr镁合金热变形行为的位错机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

以氧化型和还原型多酸为非碘电对构筑染料敏化太阳能电池的电解质

国家自然科学基金

0+阅读 · 2012年12月31日

特种石墨制备过程中原料加压湿法高效脱杂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

核壳结构Fe3O4/P(S-MA) 纳米材料的可控制备与脱除重金属离子的机制

国家自然科学基金

0+阅读 · 2012年12月31日

微量Zr、Mg等在Cu-Cr-Zr铜合金时效过程中的作用机理

国家自然科学基金

0+阅读 · 2011年12月31日

Mg-Ca-Sr合金的腐蚀降解及其降解产物的生物学效应

国家自然科学基金

0+阅读 · 2011年12月31日

Spectral Universality of Regularized Linear Regression with Nearly Deterministic Sensing Matrices

Arxiv

0+阅读 · 2022年8月4日

On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

Arxiv

0+阅读 · 2022年8月4日

High stable and accurate vehicle selection scheme based on federated edge learning in vehicular networks

Arxiv

0+阅读 · 2022年8月3日

Eliciting and Learning with Soft Labels from Every Annotator

Arxiv

0+阅读 · 2022年8月2日

Robust Training under Label Noise by Over-parameterization

Arxiv

0+阅读 · 2022年8月2日

A strong call-by-need calculus

Arxiv

0+阅读 · 2022年8月2日

Understanding the classes better with class-specific and rule-specific feature selection, and redundancy control in a fuzzy rule based framework

Arxiv

0+阅读 · 2022年8月2日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

X-BERT: eXtreme Multi-label Text Classification with BERT

X-BERT: eXtreme Multi-label Text Classification with BERT

Arxiv

12+阅读 · 2019年7月4日

VIP会员

文章信息

相关主题

相关VIP内容

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

《商用大语言模型的升级风险管理：国家安全运用》

自主人工智能：未来战争是否将是自主化的？

《从装备到文化：美陆军技术素养建设启示录》最新报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Spectral Universality of Regularized Linear Regression with Nearly Deterministic Sensing Matrices

Arxiv

0+阅读 · 2022年8月4日

On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

Arxiv

0+阅读 · 2022年8月4日

High stable and accurate vehicle selection scheme based on federated edge learning in vehicular networks

Arxiv

0+阅读 · 2022年8月3日

Eliciting and Learning with Soft Labels from Every Annotator

Arxiv

0+阅读 · 2022年8月2日

Robust Training under Label Noise by Over-parameterization

Arxiv

0+阅读 · 2022年8月2日

A strong call-by-need calculus

Arxiv

0+阅读 · 2022年8月2日

Understanding the classes better with class-specific and rule-specific feature selection, and redundancy control in a fuzzy rule based framework

Arxiv

0+阅读 · 2022年8月2日

ExSum: From Local Explanations to Model Understanding

Arxiv

13+阅读 · 2022年4月30日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

X-BERT: eXtreme Multi-label Text Classification with BERT

X-BERT: eXtreme Multi-label Text Classification with BERT

Arxiv

12+阅读 · 2019年7月4日

相关基金

Mg-RE-TM合金强化相中的点缺陷及其对力学性能的影响

国家自然科学基金

0+阅读 · 2014年12月31日

单相高熵合金凝固过程中固液界面特性及行为

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

定向凝固钛铝合金熔体与铸型涂层界面反应微观机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Mg-Al-Ca-Sr镁合金热变形行为的位错机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

以氧化型和还原型多酸为非碘电对构筑染料敏化太阳能电池的电解质

国家自然科学基金

0+阅读 · 2012年12月31日

特种石墨制备过程中原料加压湿法高效脱杂机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

核壳结构Fe3O4/P(S-MA) 纳米材料的可控制备与脱除重金属离子的机制

国家自然科学基金

0+阅读 · 2012年12月31日

微量Zr、Mg等在Cu-Cr-Zr铜合金时效过程中的作用机理

国家自然科学基金

0+阅读 · 2011年12月31日

Mg-Ca-Sr合金的腐蚀降解及其降解产物的生物学效应

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员