We present BotSIM, a data-efficient end-to-end Bot SIMulation toolkit for commercial text-based task-oriented dialog (TOD) systems. BotSIM consists of three major components: 1) a Generator that can infer semantic-level dialog acts and entities from bot definitions and generate user queries via model-based paraphrasing; 2) an agenda-based dialog user Simulator (ABUS) to simulate conversations with the dialog agents; 3) a Remediator to analyze the simulated conversations, visualize the bot health reports and provide actionable remediation suggestions for bot troubleshooting and improvement. We demonstrate BotSIM's effectiveness in end-to-end evaluation, remediation and multi-intent dialog generation via case studies on two commercial bot platforms. BotSIM's "generation-simulation-remediation" paradigm accelerates the end-to-end bot evaluation and iteration process by: 1) reducing manual test cases creation efforts; 2) enabling a holistic gauge of the bot in terms of NLU and end-to-end performance via extensive dialog simulation; 3) improving the bot troubleshooting process with actionable suggestions. A demo of our system can be found at https://tinyurl.com/mryu74cd and a demo video at https://youtu.be/qLi5iSoly30.
翻译:我们介绍BotSIM,这是一个用于商业文本导向任务对话(TOD)系统的数据高效端对端的 BotSIM 工具包。 BotSIM 由三个主要部分组成:1) 一个能够根据机器人定义推断语系层面对话行为和实体的发电机,并通过基于模型的参数定位生成用户查询;2) 一个基于议程的对话框用户模拟器(ABUS),用于模拟与对话框代理器的对话;3) 一个用于分析模拟对话的补救器,可视化机器人健康报告,并为机器人故障排除和改进提供可操作的补救建议。我们通过两个商用机器人平台的案例研究,展示BotSIM在终端对端评价、补救和多功能层面对话生成方面的有效性。 BotSIM的“新一代模拟-模拟-补救”模式加快了终端对机器人的评价和过滤过程,具体做法是:1) 减少手工测试案例创建工作;2) 能够通过广泛的对话框模拟,对机器人U和终端-端端端-端-端-端-端-端-端-端/端-端-端-端-端-状态进行测试,我们展示演示演示展示了Bet-SIM 3SIM 改进了BAT-模-模-模-模拟系统,在A/M-模-模-SIM 测试/演示过程中发现了/演示/演示/演示/演示/演示/演示/演示过程中,改进了BOBAT/演示/演示/演示/演示/演示/演示/演示/演示/演示/演示/演示/演示/制程/制程/制程。