Meta-learning methods have been widely used in few-shot named entity recognition (NER), especially prototype-based methods. However, the Other(O) class is difficult to be represented by a prototype vector because there are generally a large number of samples in the class that have miscellaneous semantics. To solve the problem, we propose MeTNet, which generates prototype vectors for entity types only but not O-class. We design an improved triplet network to map samples and prototype vectors into a low-dimensional space that is easier to be classified and propose an adaptive margin for each entity type. The margin plays as a radius and controls a region with adaptive size in the low-dimensional space. Based on the regions, we propose a new inference procedure to predict the label of a query instance. We conduct extensive experiments in both in-domain and cross-domain settings to show the superiority of MeTNet over other state-of-the-art methods. In particular, we release a Chinese few-shot NER dataset FEW-COMM extracted from a well-known e-commerce platform. To the best of our knowledge, this is the first Chinese few-shot NER dataset. All the datasets and codes are provided at https://github.com/hccngu/MeTNet.
翻译:在点名的实体识别(NER)中广泛使用元学习方法,特别是原型方法。然而,其他(O)类很难由原型矢量代表,因为通常在类中有大量样本,具有各种语义。为了解决问题,我们提议MetNet,它只为实体类型而非O类生成原型矢量。我们设计了一个改进的三重网络,将样品和原型矢量映射到一个较易分类的低维空间,并为每个实体类型提出一个适应性差值。差值作为一个半径发挥作用,控制低维空间的适应性大小区域。我们根据区域的情况,提出一个新的推论程序,以预测查询实例的标签。我们在内部和交叉环境进行广泛的实验,以显示MetNet优于其他状态-艺术方法。我们特别从一个众所周知的电子商务平台中发布了一个中国微小的NER数据设置FEW-COMM。我们最了解的是,这是在几个数据/NERSet上提供的数据。