Medical Slot Filling (MSF) task aims to convert medical queries into structured information, playing an essential role in diagnosis dialogue systems. However, the lack of sufficient term semantics learning makes existing approaches hard to capture semantically identical but colloquial expressions of terms in medical conversations. In this work, we formalize MSF into a matching problem and propose a Term Semantics Pre-trained Matching Network (TSPMN) that takes both terms and queries as input to model their semantic interaction. To learn term semantics better, we further design two self-supervised objectives, including Contrastive Term Discrimination (CTD) and Matching-based Mask Term Modeling (MMTM). CTD determines whether it is the masked term in the dialogue for each given term, while MMTM directly predicts the masked ones. Experimental results on two Chinese benchmarks show that TSPMN outperforms strong baselines, especially in few-shot settings.
翻译:医学填充(MSF)任务旨在将医疗问询转换成结构化信息,在诊断对话系统中发挥重要作用,然而,缺乏足够的术语语义学习使得现有方法难以在医疗谈话中捕捉语义相同但口述的术语表达方式。在这项工作中,我们将MSF正规化成一个匹配问题,并提议一个暂时语义学预先培训的匹配网络(TSPN),将术语和查询都作为模拟其语义互动的投入。为了更好地学习语义学,我们进一步设计了两个自我监督的目标,包括对比性术语歧视(CTD)和基于匹配的面具术语建模(MMTM)。CTD决定了它是否在对话中为每个特定术语的蒙面术语,而MMTM直接预测了蒙面的术语。两个中国基准的实验结果表明,TSPMN超越了强大的基线,特别是在几发环境中。</s>