Goal-oriented dialogue has been paid attention for its numerous applications in artificial intelligence. To solve this task, deep learning and reinforcement learning have recently been applied. However, these approaches struggle to find a competent recurrent neural questioner, owing to the complexity of learning a series of sentences. Motivated by theory of mind, we propose "Answerer in Questioner's Mind" (AQM), a novel algorithm for goal-oriented dialogue. With AQM, a questioner asks and infers based on an approximated probabilistic model of the answerer. The questioner figures out the answerer's intent via selecting a plausible question by explicitly calculating the information gain of the candidate intentions and possible answers to each question. We test our framework on two goal-oriented visual dialogue tasks: "MNIST Counting Dialog" and "GuessWhat?!." In our experiments, AQM outperforms comparative algorithms and makes human-like dialogue. We further use AQM as a tool for analyzing the mechanism of deep reinforcement learning approach and discuss the future direction of practical goal-oriented neural dialogue systems.
翻译:以目标为导向的对话因其在人工智能方面的多种应用而得到关注。为了解决这个问题,最近应用了深层次的学习和强化学习。然而,由于学习一系列句子的复杂性,这些方法努力寻找一个能胜任的反复神经质问者。根据思想理论,我们提出“质疑者心智中的反射者”(AQM),这是面向目标的对话的一种新奇算法。AQM,一个提问者根据答案者大致概率模型询问和推断。提问者通过明确计算候选人意图的信息收益和每个问题的可能答案,通过选择一个合理的问题,找出答案者的意图。我们测试了我们的两个面向目标的直观对话任务框架:“MNIST对对话框的计算”和“Guess what?”。在我们实验中,AQM超越了比较算法,并进行了类似人类的对话。我们进一步使用AQM作为分析深度强化学习方法机制的工具,并讨论面向目标的神经对话系统的未来方向。