We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based models. By applying reinforcement learning to crowdsourced data and real-world user interactions, the system has been trained to select an appropriate response from the models in its ensemble. The system has been evaluated through A/B testing with real-world users, where it performed significantly better than other systems. The results highlight the potential of coupling ensemble systems with deep reinforcement learning as a fruitful path for developing real-world, open-domain conversational agents.
翻译:我们介绍了MILABOT:由蒙特利尔学习算术研究所(MILA)为亚马逊亚历山大奖竞赛开发的深强化学习聊天室。MILABOT能够通过语言和文字与人交流流行的小话题。该系统由一系列自然语言生成和检索模型组成,包括神经网络和模板模型。通过对众源数据和现实世界用户互动应用强化学习,该系统接受了培训,从各种模型的组合中选择适当的反应。该系统通过A/B测试与现实世界用户进行了评估,其表现比其他系统要好得多。结果突出表明了将共同语言系统与深层强化学习相结合的潜力,这是发展现实世界、开放的交流媒介的一个富有成果的途径。