通过多战略再平衡实现终身全心探测 (Lifelong Intent Detection via Multi-Strategy Rebalancing)

Conventional Intent Detection (ID) models are usually trained offline, which relies on a fixed dataset and a predefined set of intent classes. However, in real-world applications, online systems usually involve continually emerging new user intents, which pose a great challenge to the offline training paradigm. Recently, lifelong learning has received increasing attention and is considered to be the most promising solution to this challenge. In this paper, we propose Lifelong Intent Detection (LID), which continually trains an ID model on new data to learn newly emerging intents while avoiding catastrophically forgetting old data. Nevertheless, we find that existing lifelong learning methods usually suffer from a serious imbalance between old and new data in the LID task. Therefore, we propose a novel lifelong learning method, Multi-Strategy Rebalancing (MSR), which consists of cosine normalization, hierarchical knowledge distillation, and inter-class margin loss to alleviate the multiple negative effects of the imbalance problem. Experimental results demonstrate the effectiveness of our method, which significantly outperforms previous state-of-the-art lifelong learning methods on the ATIS, SNIPS, HWU64, and CLINC150 benchmarks.

翻译：常规意图探测(ID)模型通常是在离线上培训的,这些模型依赖固定数据集和一套预先确定的意向类。然而,在现实世界应用中,在线系统通常涉及不断出现的新的用户意图,这对离线培训模式构成巨大挑战。最近,终身学习受到越来越多的关注,被认为是应对这一挑战的最有希望的解决办法。在本文中,我们提议终身意图探测(LID)模型,不断对新数据进行ID模型培训,以了解新出现的意图,同时避免灾难性地忘记旧数据。然而,我们发现,现有的终身学习方法通常在LID任务中老数据与新数据之间严重失衡。因此,我们提出了一种新的终身学习方法,即多战略平衡(MSR),其中包括共生正常化、分级知识蒸馏和阶级间差值损失,以缓解不平衡问题的多重负面影响。实验结果表明我们的方法的有效性,大大超越了ATIS、SNIPS、HWU64和CLINC1基准中以往的状态终身学习方法。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日