学习如何在重复互动中共享自主 (Learning to Share Autonomy Across Repeated Interaction)

Wheelchair-mounted robotic arms (and other assistive robots) should help their users perform everyday tasks. One way robots can provide this assistance is shared autonomy. Within shared autonomy, both the human and robot maintain control over the robot's motion: as the robot becomes confident it understands what the human wants, it increasingly intervenes to automate the task. But how does the robot know what tasks the human may want to perform in the first place? Today's shared autonomy approaches often rely on prior knowledge: for example, the robot must know the set of possible human goals a priori. In the long-term, however, this prior knowledge will inevitably break down -- sooner or later the human will reach for a goal that the robot did not expect. In this paper we propose a learning approach to shared autonomy that takes advantage of repeated interactions. Learning to assist humans would be impossible if they performed completely different tasks at every interaction: but our insight is that users living with physical disabilities repeat important tasks on a daily basis (e.g., opening the fridge, making coffee, and having dinner). We introduce an algorithm that exploits these repeated interactions to recognize the human's task, replicate similar demonstrations, and return control when unsure. As the human repeatedly works with this robot, our approach continually learns to assist tasks that were never specified beforehand: these tasks include both discrete goals (e.g., reaching a cup) and continuous skills (e.g., opening a drawer). Across simulations and an in-person user study, we demonstrate that robots leveraging our approach match existing shared autonomy methods for known goals, and outperform imitation learning baselines on new tasks. See videos here: https://youtu.be/Plh4t3wQeIA

翻译：轮椅搭乘的机器人臂(和其他辅助机器人)应该帮助其用户完成日常任务。机器人可以提供这种协助的方式之一是共享自主性。在共享自主性的范围内,人类和机器人都可以在共享自主性的范围内保持对机器人运动的控制:随着机器人相信自己想要的东西,它会越来越多地干预任务自动化。但是机器人如何知道人类最初可能想要完成什么任务?今天的共享自主性方法往往取决于先前的知识:例如,机器人必须先验地了解一套可能的人类目标。然而,从长远来看,这种先前的知识将不可避免地破碎 -- -- 迟早人类会达到机器人无法预期的目标。在本文中,我们提出一种共享自主性的学习方法,利用反复的互动。如果人类在每次互动中都执行完全不同的任务,那么学习帮助人类是不可能做到的:但是我们的洞穴操作重复重要任务(例如,打开冰箱,打开更近距离的视频,煮咖啡,吃晚宴)。我们引入了一个算法,利用这些反复的交互性互动来认识人类的任务,复制了同样的演示任务。当我们反复学习的时候,这些任务包括了机器人的任务, 不断学习的技巧, 不断学习。