Open intent detection is a significant problem in natural language understanding, which aims to detect the unseen open intent with the prior knowledge of only known intents. Current methods have two core challenges in this task. On the one hand, they have limitations in learning friendly representations to detect the open intent. On the other hand, there lacks an effective approach to obtaining specific and compact decision boundaries for known intents. To address these issues, this paper introduces an original framework, DA-ADB, which successively learns distance-aware intent representations and adaptive decision boundaries for open intent detection. Specifically, we first leverage distance information to enhance the distinguishing capability of the intent representations. Then, we design a novel loss function to obtain appropriate decision boundaries by balancing both empirical and open space risks. Extensive experiments show the effectiveness of distance-aware and boundary learning strategies. Compared with the state-of-the-art methods, our method achieves substantial improvements on three benchmark datasets. It also yields robust performance with different proportions of labeled data and known categories. The full data and codes are available at https://github.com/thuiar/TEXTOIR
翻译:在自然语言理解方面,公开意图的探测是一个重大问题,目的是在事先知道唯一已知意图的情况下发现隐蔽的公开意图。目前的方法在这项任务中有两个核心挑战。一方面,它们学习友好的表示来发现公开意图方面有局限性。另一方面,在为已知意图获得具体和紧凑的决定界限方面缺乏有效的办法。为了解决这些问题,本文件引入了一个原始框架,即DA-ADB,它相继学习远程意识意图表示和适应性决定界限,以便公开意图探测。具体地说,我们首先利用远程信息来提高意图表示的区分能力。然后,我们设计一个新的损失功能,通过平衡实证和开放空间风险来获得适当的决定界限。广泛的实验表明远距离觉察和边界学习战略的有效性。与最新的方法相比,我们的方法在三个基准数据集上取得了重大改进。它还以不同比例的标签数据和已知类别产生强劲的性能。完整的数据和代码见https://github.com/thuar/TEXTOIR。