In this work, we study how to build socially intelligent robots to assist people in their homes. In particular, we focus on assistance with online goal inference, where robots must simultaneously infer humans' goals and how to help them achieve those goals. Prior assistance methods either lack the adaptivity to adjust helping strategies (i.e., when and how to help) in response to uncertainty about goals or the scalability to conduct fast inference in a large goal space. Our NOPA (Neurally-guided Online Probabilistic Assistance) method addresses both of these challenges. NOPA consists of (1) an online goal inference module combining neural goal proposals with inverse planning and particle filtering for robust inference under uncertainty, and (2) a helping planner that discovers valuable subgoals to help with and is aware of the uncertainty in goal inference. We compare NOPA against multiple baselines in a new embodied AI assistance challenge: Online Watch-And-Help, in which a helper agent needs to simultaneously watch a main agent's action, infer its goal, and help perform a common household task faster in realistic virtual home environments. Experiments show that our helper agent robustly updates its goal inference and adapts its helping plans to the changing level of uncertainty.
翻译:在这项工作中,我们研究如何建设社会智能机器人以帮助人们的家园。特别是,我们侧重于在线目标推断援助,机器人必须同时推断人类的目标和如何帮助他们实现这些目标。先前的援助方法要么缺乏适应性来调整帮助战略(即,何时和如何帮助),以应对目标的不确定性或在一个巨大的目标空间中快速推断的可缩放性。我们的NOPA(Neurally-guided Online Probabilitical Access)方法解决了这两个挑战。NOPA包括:(1)一个在线目标推断模块,将神经目标提议与逆向规划和粒子过滤相结合,以便在不确定情况下进行稳健的推断。(2)一个帮助发现有价值的次级目标(即何时和如何帮助)并意识到目标推断不确定性的帮助规划者。我们比较NOPA在一个新的体现的AI援助挑战:在线监视和帮助者需要同时观看一个主要代理人的行动,即其目标,并帮助其执行共同的家庭任务,在现实的虚拟环境变化中更快地更新其目标。