The challenge of information extraction (IE) lies in the diversity of label schemas and the heterogeneity of structures. Traditional methods require task-specific model design and rely heavily on expensive supervision, making them difficult to generalize to new schemas. In this paper, we decouple IE into two basic abilities, structuring and conceptualizing, which are shared by different tasks and schemas. Based on this paradigm, we propose to universally model various IE tasks with Unified Semantic Matching (USM) framework, which introduces three unified token linking operations to model the abilities of structuring and conceptualizing. In this way, USM can jointly encode schema and input text, uniformly extract substructures in parallel, and controllably decode target structures on demand. Empirical evaluation on 4 IE tasks shows that the proposed method achieves state-of-the-art performance under the supervised experiments and shows strong generalization ability in zero/few-shot transfer settings.
翻译:信息提取(IE)的挑战在于标签模式的多样性和结构的异质性。传统方法需要具体任务的模式设计,并严重依赖昂贵的监督,因此难以将其推广到新的模式。在本文中,我们将IE分为两个基本能力,即结构化和概念化,由不同任务和计划共同组成。基于这一模式,我们提议以统一的语义匹配(USM)框架普遍模拟各种IE任务,引入三个统一的标志,将操作与结构化和概念化能力的模型联系起来。这样,USM可以联合将Schema和输入文本编码,统一平行提取子结构,并控制地解码需求目标结构。对4项IE任务的经验性评估表明,在监督的实验下,拟议方法达到了最先进的性能,并在零发/发光传输环境中展示了强大的普及能力。