Goal-oriented dialog has been given attention due to its numerous applications in artificial intelligence. Goal-oriented dialogue tasks occur when a questioner asks an action-oriented question and an answerer responds with the intent of letting the questioner know a correct action to take. To ask the adequate question, deep learning and reinforcement learning have been recently applied. However, these approaches struggle to find a competent recurrent neural questioner, owing to the complexity of learning a series of sentences. Motivated by theory of mind, we propose "Answerer in Questioner's Mind" (AQM), a novel algorithm for goal-oriented dialog. With AQM, a questioner asks and infers based on an approximated probabilistic model of the answerer. The questioner figures out the answerer's intention via selecting a plausible question by explicitly calculating the information gain of the candidate intentions and possible answers to each question. We test our framework on two goal-oriented visual dialog tasks: "MNIST Counting Dialog" and "GuessWhat?!." In our experiments, AQM outperforms comparative algorithms by a large margin.
翻译:以目标为导向的对话因其在人工智能方面的多种应用而得到关注。当询问者询问一个面向行动的问题和回答者回应时,目标导向的对话任务就会发生,目的是让询问者知道正确的行动。问一个适当的问题,最近就应用了深层次的学习和强化学习。然而,由于学习一系列句子的复杂性,这些方法努力寻找一个合格的经常性神经问答者。根据思想理论,我们提议“质疑者心智中的反射者”(AQM),这是面向目标的对话的新算法。用AQM,一个提问者根据答案者大致的概率模型提问和推断。提问者通过明确计算候选人意图和每个问题可能的答案所获得的信息,通过选择一个合理的问题来说明答案的意图。我们测试我们的框架有两个面向目标的视觉对话任务:“MNIST对对话框的计算”和“Guessaf?” 。在我们实验中,AQM用一个大幅度的比较算法。