Procedural text describes dynamic state changes during a step-by-step natural process (e.g., photosynthesis). In this work, we focus on the task of procedural text understanding, which aims to comprehend such documents and track entities' states and locations during a process. Although recent approaches have achieved substantial progress, their results are far behind human performance. Two challenges, the difficulty of commonsense reasoning and data insufficiency, still remain unsolved, which require the incorporation of external knowledge bases. Previous works on external knowledge injection usually rely on noisy web mining tools and heuristic rules with limited applicable scenarios. In this paper, we propose a novel KnOwledge-Aware proceduraL text understAnding (KOALA) model, which effectively leverages multiple forms of external knowledge in this task. Specifically, we retrieve informative knowledge triples from ConceptNet and perform knowledge-aware reasoning while tracking the entities. Besides, we employ a multi-stage training schema which fine-tunes the BERT model over unlabeled data collected from Wikipedia before further fine-tuning it on the final model. Experimental results on two procedural text datasets, ProPara and Recipes, verify the effectiveness of the proposed methods, in which our model achieves state-of-the-art performance in comparison to various baselines.
翻译:程序文本描述了逐步自然过程中动态状态的变化(例如光合作用)。在这项工作中,我们侧重于程序性文本理解的任务,目的是理解这类文件,并跟踪某个过程中的实体的状态和位置。虽然最近的方法取得了实质性进展,但其结果远远落后于人类的绩效。两个挑战,即常识推理和数据不足的困难,仍未解决,这需要纳入外部知识基础。以往的外部知识注入工作通常依赖于杂乱的网络采矿工具和超常规则,且适用情景有限。在本文中,我们提出了一个新的KnOwledge-AwarduraL preceduraL 文本基础(KOALA)模型,该模型有效地利用了多种形式的外部知识。具体地说,我们从概念网络检索了信息知识知识的三倍,并在跟踪实体时进行了知识认知推理。此外,我们采用了多阶段培训计划,在对从维基百科收集的未加标签的数据进行微调之前,我们提出了一个新的KnOwledge-Aware-Aware proeduraL press press presswing (KO) (KOAL) practalal exal ress) reseralbalbal resgresulational resulation resulational-st supat supat supturgrestimets