Cross-task knowledge transfer via multi-task learning has recently made remarkable progress in general NLP tasks. However, entity tracking on the procedural text has not benefited from such knowledge transfer because of its distinct formulation, i.e., tracking the event flow while following structural constraints. State-of-the-art entity tracking approaches either design complicated model architectures or rely on task-specific pre-training to achieve good results. To this end, we propose MeeT, a Multi-task learning-enabled entity Tracking approach, which utilizes knowledge gained from general domain tasks to improve entity tracking. Specifically, MeeT first fine-tunes T5, a pre-trained multi-task learning model, with entity tracking-specialized QA formats, and then employs our customized decoding strategy to satisfy the structural constraints. MeeT achieves state-of-the-art performances on two popular entity tracking datasets, even though it does not require any task-specific architecture design or pre-training.
翻译:通过多任务学习进行跨部门任务知识转让,最近在一般国家劳工规划任务方面取得了显著进展,然而,对程序文本的实体跟踪没有从这种知识转让中受益,因为其制定方式不同,即跟踪事件流动,同时遵循结构性限制; 最先进的实体跟踪方法要么设计复杂的模型结构,要么依靠特定任务前培训取得良好成果; 为此,我们提议采用多任务学习驱动的实体跟踪方法,即多任务学习驱动的实体跟踪方法,利用从一般域任务中获得的知识来改进实体跟踪。 具体而言,MeT第一级微调T5,即预先培训的多任务学习模式,采用实体跟踪专业化的QA格式,然后采用我们定制的解码战略来满足结构性限制。MeT在两个受欢迎的实体跟踪数据集上实现最先进的业绩,尽管它并不需要任何特定任务的架构设计或预培训。