多银行拆库评估的易变性 (The Fragility of Multi-Treebank Parsing Evaluation) - 专知论文

会员服务 ·

0

树库 · contrastive · 有偏 · 得分 · 论文 ·

2022 年 9 月 14 日

The Fragility of Multi-Treebank Parsing Evaluation

翻译：多银行拆库评估的易变性

Iago Alonso-Alonso,David Vilares,Carlos Gómez-Rodríguez

from arxiv, Accepted at COLING 2022

Treebank selection for parsing evaluation and the spurious effects that might arise from a biased choice have not been explored in detail. This paper studies how evaluating on a single subset of treebanks can lead to weak conclusions. First, we take a few contrasting parsers, and run them on subsets of treebanks proposed in previous work, whose use was justified (or not) on criteria such as typology or data scarcity. Second, we run a large-scale version of this experiment, create vast amounts of random subsets of treebanks, and compare on them many parsers whose scores are available. The results show substantial variability across subsets and that although establishing guidelines for good treebank selection is hard, it is possible to detect potentially harmful strategies.

翻译：用于分析评估的树库选择以及偏向选择可能产生的虚假影响尚未详细探讨。本文研究的是,对单个树库子集的评估如何会导致薄弱的结论。首先,我们采取一些对比式的采伐者,并在先前工作中提议的树库子集中运行这些分类,这些分类在类型学或数据稀缺等标准上使用是合理的(或不合理 ) 。其次,我们进行了大规模实验,创造了大量随机的树库子集,并比较了许多有分数的采伐者。研究结果显示,各子集之间差异很大,尽管为良好的树库选择制定准则是困难的,但有可能发现潜在的有害策略。

0

相关内容

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

泥石流冲击作用下桥梁结构损伤机理与计算方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

水溶性SMF共聚树脂木材胶粘剂的结构、反应与固化机理

国家自然科学基金

0+阅读 · 2012年12月31日

立体多维电催化电极的制备及处理船舶压载水性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

小麦族植物St、Y、P、H基因组间的演化关系研究

国家自然科学基金

0+阅读 · 2008年12月31日

The Distressing Ads That Persist: Uncovering The Harms of Targeted Weight-Loss Ads Among Users with Histories of Disordered Eating

Arxiv

0+阅读 · 2022年10月25日

Evaluation of Argo Scholar with Observational Study

Arxiv

0+阅读 · 2022年10月24日

Locality-Preserving Minimal Perfect Hashing of k-mers

Arxiv

0+阅读 · 2022年10月24日

Conditional set generation using Seq2seq models

Arxiv

0+阅读 · 2022年10月24日

Simulated redistricting plans for the analysis and evaluation of redistricting in the United States

Arxiv

0+阅读 · 2022年10月21日

VIP会员

文章信息

相关主题

相关VIP内容

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

The Distressing Ads That Persist: Uncovering The Harms of Targeted Weight-Loss Ads Among Users with Histories of Disordered Eating

Arxiv

0+阅读 · 2022年10月25日

Evaluation of Argo Scholar with Observational Study

Arxiv

0+阅读 · 2022年10月24日

Locality-Preserving Minimal Perfect Hashing of k-mers

Arxiv

0+阅读 · 2022年10月24日

Conditional set generation using Seq2seq models

Arxiv

0+阅读 · 2022年10月24日

Simulated redistricting plans for the analysis and evaluation of redistricting in the United States

Arxiv

0+阅读 · 2022年10月21日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

泥石流冲击作用下桥梁结构损伤机理与计算方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

水溶性SMF共聚树脂木材胶粘剂的结构、反应与固化机理

国家自然科学基金

0+阅读 · 2012年12月31日

立体多维电催化电极的制备及处理船舶压载水性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

小麦族植物St、Y、P、H基因组间的演化关系研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员