Zero-shot intent detection (ZSID) aims to deal with the continuously emerging intents without annotated training data. However, existing ZSID systems suffer from two limitations: 1) They are not good at modeling the relationship between seen and unseen intents, when the label names are given in the form of raw phrases or sentences. 2) They cannot effectively recognize unseen intents under the generalized intent detection (GZSID) setting. A critical factor behind these limitations is the representations of unseen intents, which cannot be learned in the training stage. To address this problem, we propose a class-transductive framework that utilizes unseen class labels to learn Disentangled Intent Representations (DIR). Specifically, we allow the model to predict unseen intents in the training stage, with the corresponding label names serving as input utterances. Under this framework, we introduce a multi-task learning objective, which encourages the model to learn the distinctions among intents, and a similarity scorer, which estimates the connections among intents more accurately based on the learned intent representations. Since the purpose of DIR is to provide better intent representations, it can be easily integrated with existing ZSID and GZSID methods. Experiments on two real-world datasets show that the proposed framework brings consistent improvement to the baseline systems, regardless of the model architectures or zero-shot learning strategies.
翻译:零射意图探测(ZSID)的目的是在没有附加说明的培训数据的情况下处理不断出现的意图,然而,现有的ZSID系统存在两个限制:(1) 当标签名称以原始短语或句子的形式提供时,它们并不善于模拟所见意图和不可见意图之间的关系;(2) 它们无法有效识别普遍意图探测(GZSID)设置下的无形意图; 这些限制背后的一个关键因素是隐蔽意图的表示,这些意图在培训阶段是无法学到的; 为了解决这一问题,我们提议了一个班级过渡框架,利用隐蔽的类标签学习分解的内在表示(DIR) 。具体地说,我们允许该模型在培训阶段预测所见意图和不可见意图之间的关系,而相应的标签名称则作为投入语句。在这个框架内,我们引入了一个多重任务学习目标,鼓励模型了解意图的区别,以及一个类似性评分,根据所学的意向陈述,更准确地估计各种意图之间的联系。由于DIR的目的是要提供更好的意图显示分解的内在的表示(DIR)说明,因此,我们允许模型在培训阶段预测看不见的无形意图,可以轻松地显示现有的实验系统。