用于序列标签的掩码有条件随机字段 (Masked Conditional Random Fields for Sequence Labeling)

Conditional Random Field (CRF) based neural models are among the most performant methods for solving sequence labeling problems. Despite its great success, CRF has the shortcoming of occasionally generating illegal sequences of tags, e.g. sequences containing an "I-" tag immediately after an "O" tag, which is forbidden by the underlying BIO tagging scheme. In this work, we propose Masked Conditional Random Field (MCRF), an easy to implement variant of CRF that impose restrictions on candidate paths during both training and decoding phases. We show that the proposed method thoroughly resolves this issue and brings consistent improvement over existing CRF-based models with near zero additional cost.

翻译：以有条件随机场为基础的神经模型是解决序列标签问题最有效果的方法之一。尽管它取得了巨大成功,但通用报告格式的缺点是偶尔产生非法的标签序列,例如在“O”标记之后立即含有“I”标记的序列,这是基本BIO标签办法所禁止的。在这项工作中,我们提议采用蒙面随机场(MCRF),这是易于执行的通用报告格式变量,在培训和解码阶段对候选人路径施加限制。我们表明,拟议的方法彻底解决这一问题,并以近乎零额外费用的方式持续改进基于通用报告格式的现有模型。

相关内容

条件随机场

关注 341

条件随机域（场）（conditional random fields，简称 CRF，或CRFs），是一种判别式概率模型，是随机场的一种，常用于标注或分析序列资料，如自然语言文字或是生物序列。如同马尔可夫随机场，条件随机场为具有无向的图模型，图中的顶点代表随机变量，顶点间的连线代表随机变量间的相依关系，在条件随机场中，随机变量 Y 的分布为条件机率，给定的观察值则为随机变量 X。原则上，条件随机场的图模型布局是可以任意给定的，一般常用的布局是链结式的架构，链结式架构不论在训练（training）、推论（inference）、或是解码（decoding）上，都存在效率较高的算法可供演算。

专知会员服务

30+阅读 · 2020年11月4日

【文本生成现代方法】Modern Methods for Text Generation

专知会员服务

44+阅读 · 2020年9月11日

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【剑桥大学博士论文】深层神经网络结构的复兴，147页pdf，The resurgence of structure in deep neural networks

专知会员服务

20+阅读 · 2020年5月14日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models