重新研究对依赖性采掘者进行三培训的问题 (Revisiting Tri-training of Dependency Parsers)

We compare two orthogonal semi-supervised learning techniques, namely tri-training and pretrained word embeddings, in the task of dependency parsing. We explore language-specific FastText and ELMo embeddings and multilingual BERT embeddings. We focus on a low resource scenario as semi-supervised learning can be expected to have the most impact here. Based on treebank size and available ELMo models, we select Hungarian, Uyghur (a zero-shot language for mBERT) and Vietnamese. Furthermore, we include English in a simulated low-resource setting. We find that pretrained word embeddings make more effective use of unlabelled data than tri-training but that the two approaches can be successfully combined.

翻译：我们比较了两种正统半监督的学习技术,即依赖分析任务中的三门培训和预先训练的字嵌入。我们探索了语言特定快图和ELMo嵌入和多语种的BERT嵌入。我们关注的是一种低资源设想方案,因为半监督的学习可以在这里产生最大的影响。根据树银行规模和现有的ELMO模型,我们选择了匈牙利语、Uyghur语(MBERT的零速语言)和越南语。此外,我们将英语纳入了模拟的低资源设置。我们发现,预先训练的字嵌入比三门培训更有效地使用未贴标签的数据,但这两种方法可以成功地结合起来。

相关内容

ELMo

关注 19

近年来，研究人员通过文本上下文信息分析获得更好的词向量。ELMo是其中的翘楚，在多个任务、多个数据集上都有显著的提升。所以，它是目前最好用的词向量，the-state-of-the-art的方法。这篇文章发表在2018年的NAACL上，outstanding paper award。下面就简单介绍一下这个“神秘”的词向量模型。

【SIGIR2021】自然语言处理图深度学习，230页ppt

专知会员服务

95+阅读 · 2021年7月23日

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

专知会员服务

23+阅读 · 2020年4月21日