Real-life applications, heavily relying on machine learning, such as dialog systems, demand out-of-domain detection methods. Intent classification models should be equipped with a mechanism to distinguish seen intents from unseen ones so that the dialog agent is capable of rejecting the latter and avoiding undesired behavior. However, despite increasing attention paid to the task, the best practices for out-of-domain intent detection have not yet been fully established. This paper conducts a thorough comparison of out-of-domain intent detection methods. We prioritize the methods, not requiring access to out-of-domain data during training, gathering of which is extremely time- and labor-consuming due to lexical and stylistic variation of user utterances. We evaluate multiple contextual encoders and methods, proven to be efficient, on three standard datasets for intent classification, expanded with out-of-domain utterances. Our main findings show that fine-tuning Transformer-based encoders on in-domain data leads to superior results. Mahalanobis distance, together with utterance representations, derived from Transformer-based encoders, outperforms other methods by a wide margin and establishes new state-of-the-art results for all datasets. The broader analysis shows that the reason for success lies in the fact that the fine-tuned Transformer is capable of constructing homogeneous representations of in-domain utterances, revealing geometrical disparity to out of domain utterances. In turn, the Mahalanobis distance captures this disparity easily. The code is available in our GitHub repo: https://github.com/huawei-noah/noah-research/tree/master/Maha_OOD .
翻译:真实生活应用, 严重依赖机器学习, 如对话系统、 需求外域检测方法等。 原始分类模型应该配备一个机制, 以区分可见意图和无形意图, 使对话代理器能够拒绝后者, 避免不理想行为。 然而, 尽管任务日益受到重视, 外部意图检测的最佳做法尚未完全建立。 本文对外部意图检测方法进行了彻底比较 。 我们优先考虑方法, 不需要在培训中容易获取外域数据。 培训的收集由于用户语句的词汇和时空变异而非常耗时和劳力。 我们评估多种背景编码和方法, 事实证明是有效的, 3个用于意图分类的标准数据集, 并随外域外域捕获。 我们的主要发现显示, 在内部数据上对基于变换器/ 编码进行微调, 导致结果更优。 Mahalanobis 距离, 连同从基于变换的内域内域内置码和内域域域内变换的内值数据, 以更宽的内值变变变的内程, 显示我们更深的内置的内置的内程变变的内程数据, 显示其他结果结果。 。 由以正变的内变的内变的内变的内变的内变的内变的内变式的内变的内变的内变的内变的内变法, 的内变的内变的内变的内变的内变的内变的内变的内变的内变的内变式数据结果结果, 。