Conventional Intent Detection (ID) models are usually trained offline, which relies on a fixed dataset and a predefined set of intent classes. However, in real-world applications, online systems usually involve continually emerging new user intents, which pose a great challenge to the offline training paradigm. Recently, lifelong learning has received increasing attention and is considered to be the most promising solution to this challenge. In this paper, we propose Lifelong Intent Detection (LID), which continually trains an ID model on new data to learn newly emerging intents while avoiding catastrophically forgetting old data. Nevertheless, we find that existing lifelong learning methods usually suffer from a serious imbalance between old and new data in the LID task. Therefore, we propose a novel lifelong learning method, Multi-Strategy Rebalancing (MSR), which consists of cosine normalization, hierarchical knowledge distillation, and inter-class margin loss to alleviate the multiple negative effects of the imbalance problem. Experimental results demonstrate the effectiveness of our method, which significantly outperforms previous state-of-the-art lifelong learning methods on the ATIS, SNIPS, HWU64, and CLINC150 benchmarks.
翻译:常规意图探测(ID)模型通常是在离线上培训的,这些模型依赖固定数据集和一套预先确定的意向类。然而,在现实世界应用中,在线系统通常涉及不断出现的新的用户意图,这对离线培训模式构成巨大挑战。最近,终身学习受到越来越多的关注,被认为是应对这一挑战的最有希望的解决办法。在本文中,我们提议终身意图探测(LID)模型,不断对新数据进行ID模型培训,以了解新出现的意图,同时避免灾难性地忘记旧数据。然而,我们发现,现有的终身学习方法通常在LID任务中老数据与新数据之间严重失衡。因此,我们提出了一种新的终身学习方法,即多战略平衡(MSR),其中包括共生正常化、分级知识蒸馏和阶级间差值损失,以缓解不平衡问题的多重负面影响。实验结果表明我们的方法的有效性,大大超越了ATIS、SNIPS、HWU64和CLINC1基准中以往的状态终身学习方法。