We present NLU++, a novel dataset for natural language understanding (NLU) in task-oriented dialogue (ToD) systems, with the aim to provide a much more challenging evaluation environment for dialogue NLU models, up to date with the current application and industry requirements. NLU++ is divided into two domains (BANKING and HOTELS) and brings several crucial improvements over current commonly used NLU datasets. 1) NLU++ provides fine-grained domain ontologies with a large set of challenging multi-intent sentences, introducing and validating the idea of intent modules that can be combined into complex intents that convey complex user goals, combined with finer-grained and thus more challenging slot sets. 2) The ontology is divided into domain-specific and generic (i.e., domain-universal) intent modules that overlap across domains, promoting cross-domain reusability of annotated examples. 3) The dataset design has been inspired by the problems observed in industrial ToD systems, and 4) it has been collected, filtered and carefully annotated by dialogue NLU experts, yielding high-quality annotated data. Finally, we benchmark a series of current state-of-the-art NLU models on NLU++; the results demonstrate the challenging nature of the dataset, especially in low-data regimes, the validity of `intent modularisation', and call for further research on ToD NLU.
翻译:我们提出NLU++,这是在以任务为导向的对话系统中自然语言理解(NLU)的新数据集,目的是为对话NLU模式提供一个更具挑战性的评价环境,与当前应用和行业要求相适应。 NLU++分为两个领域(Banking和HOTLES),对目前常用的NLU数据集带来若干重大改进。 1 NLU++为当前通用的NLU数据集提供了细微的域域名,并配有一套具有挑战性、多重意图的句子。 引入并验证了意向模块的想法,这些模块可以合并为传递复杂的用户目标的复杂意图,加上精细的、因而更具挑战性的时档组合。 2) 主题组分为具体领域和通用(即广域通用)的意向模块,促进附加说明的示例的跨区域重复性。 3)数据集的设计受到工业多用途系统所观察到的问题的启发,4)它已经进一步收集、过滤和仔细补充了NLU专家在NL-L标准化方面进行的对话,特别是具有挑战性的当前标准性的数据系列。