We have been witnessing the usefulness of conversational AI systems such as Siri and Alexa, directly impacting our daily lives. These systems normally rely on machine learning models evolving over time to provide quality user experience. However, the development and improvement of the models are challenging because they need to support both high (head) and low (tail) usage scenarios, requiring fine-grained modeling strategies for specific data subsets or slices. In this paper, we explore the recent concept of slice-based learning (SBL) (Chen et al., 2019) to improve our baseline conversational skill routing system on the tail yet critical query traffic. We first define a set of labeling functions to generate weak supervision data for the tail intents. We then extend the baseline model towards a slice-aware architecture, which monitors and improves the model performance on the selected tail intents. Applied to de-identified live traffic from a commercial conversational AI system, our experiments show that the slice-aware model is beneficial in improving model performance for the tail intents while maintaining the overall performance.
翻译:我们目睹了Siri和Alexa等对话性人工智能系统的有用性,直接影响到我们的日常生活。这些系统通常依赖随着时间而演变的机器学习模型,以提供高质量的用户经验。然而,这些模型的开发和改进具有挑战性,因为它们需要支持高(头)和低(尾)使用情景,要求为特定数据子集或切片制定细微的模型战略。在本文件中,我们探索了切片学习(SBL)(Chen等人,2019年)的最近概念,以改进我们在尾巴上但关键查询流量上的基线对话技能定线系统。我们首先界定了一套标签功能,以生成关于尾巴意图的薄弱监管数据。我们随后将基线模型扩展为切除式结构,以监测和改进选定尾巴意图的模型性能。我们应用到商业谈话性人工智能系统去辨别的现场流量,我们的实验显示,切片认知模型有助于改进尾巴意图的模型性能,同时保持总体性能。