Robot assistants for older adults and people with disabilities need to interact with their users in collaborative tasks. The core component of these systems is an interaction manager whose job is to observe and assess the task, and infer the state of the human and their intent to choose the best course of action for the robot. Due to the sparseness of the data in this domain, the policy for such multi-modal systems is often crafted by hand; as the complexity of interactions grows this process is not scalable. In this paper, we propose a reinforcement learning (RL) approach to learn the robot policy. In contrast to the dialog systems, our agent is trained with a simulator developed by using human data and can deal with multiple modalities such as language and physical actions. We conducted a human study to evaluate the performance of the system in the interaction with a user. Our designed system shows promising preliminary results when it is used by a real user.
翻译:用于老年人和残疾人的机器人助理需要与用户在协作任务中互动。 这些系统的核心组成部分是互动管理者,其职责是观察和评估任务,并推断人的状况和他们为机器人选择最佳行动方针的意图。由于该领域的数据稀少,这种多模式系统的政策往往由手工制定;由于互动的复杂性使这一过程难以扩展。在本文件中,我们建议采用强化学习(RL)方法来学习机器人政策。与对话系统不同,我们的代理者接受使用人类数据开发的模拟器的培训,并能够处理语言和物理行动等多种模式。我们进行了人类研究,以评估系统在与用户互动时的性能。我们设计的系统显示,当实际用户使用时,其初步结果很有希望。</s>