Fine-Grained Named Entity Typing (FG-NET) aims at classifying the entity mentions into a wide range of entity types (usually hundreds) depending upon the context. While distant supervision is the most common way to acquire supervised training data, it brings in label noise, as it assigns type labels to the entity mentions irrespective of mentions context. In attempts to deal with the label noise, leading research on the FG-NET assumes that the fine-grained entity typing data possesses a euclidean nature, which restraints the ability of the existing models in combating the label noise. Given the fact that the fine-grained type hierarchy exhibits a hierarchical structure, it makes hyperbolic space a natural choice to model the FG-NET data. In this research, we propose FGNET-RH, a novel framework that benefits from the hyperbolic geometry in combination with the graph structures to perform entity typing in a performance-enhanced fashion. FGNET-RH initially uses LSTM networks to encode the mention in relation with its context, later it forms a graph to distill/refine the mention encodings in the hyperbolic space. Finally, the refined mention encoding is used for entity typing. Experimentation using different benchmark datasets shows that FGNET-RH improves the performance on FG-NET by up to 3.5-% in terms of strict accuracy.
翻译:在试图处理标签噪音时,对FG-NET的领先研究假定,精选实体打字数据具有优cliidean性质,限制了现有模型在打击标签噪音方面进行分类的能力。鉴于精选类型等级结构显示等级结构,因此它使超斜体空间成为模拟FG-NET数据的一种自然选择。在这项研究中,我们提议FGNET-RH,这是一个新的框架,它与图表结构相结合,从超单体地理测量学中受益,以业绩强化的方式进行实体打字。FGNET-RH最初使用LSTM网络来记录其背景中的提及,后来它制作了一个图表,用于对FG-NET数据进行精选/检索,然后用G-RET数据的精确性能进行精密化,最后,用G-RET数据库的精确性能升级,用G-REW数据库的精确性能升级,用G-RF-RF的精确性能数据库的精确性能,最后,用G-RF-RF的精确性能升级数据库,用G-G-G-reglistalgildal 的精确数据格式进行改进。