Open intent classification is a challenging task in dialogue systems. On the one hand, we should ensure the classification quality of known intents. On the other hand, we need to identify the open (unknown) intent during testing. Current models are limited in finding the appropriate decision boundary to balance the performances of both known and open intents. In this paper, we propose a post-processing method to learn the adaptive decision boundary (ADB) for open intent classification. We first utilize the labeled known intent samples to pre-train the model. Then, we use the well-trained features to automatically learn the adaptive spherical decision boundaries for each known intent. Specifically, we propose a new loss function to balance both the empirical risk and the open space risk. Our method does not need open samples and is free from modifying the model architecture. We find our approach is surprisingly insensitive with less labeled data and fewer known intents. Extensive experiments on three benchmark datasets show that our method yields significant improvements compared with the state-of-the-art methods.(The code is available at https://github.com/thuiar/Adaptive-Decision-Boundary)
翻译:在对话系统中,开放意图分类是一项具有挑战性的任务。一方面,我们应该确保已知意图的分类质量。另一方面,我们需要在测试中确定开放(未知)意图的分类质量。当前模型在寻找适当的决定边界以平衡已知意图和开放意图的性能方面受到限制。在本文件中,我们提出了一个后处理方法,用于学习开放意图分类的适应性决定边界(ADB)。我们首先使用贴有标签的已知意图样品对模型进行预演。然后,我们使用经过良好训练的特性,自动了解每个已知意图的适应性球状决定界限。具体地说,我们提议一种新的损失功能,以平衡经验风险和开放空间风险。我们的方法不需要开放样品,而且不需修改模型结构。我们发现我们的方法对标签较少的数据和已知意图都不太敏感。关于三个基准数据集的广泛实验显示,我们的方法比状态-艺术方法有显著的改进。(代码见https://github.com/thuar/Adaptiment-Decion-Boundary)