We propose the first framework to learn control policies for vision-based human-to-robot handovers, a critical task for human-robot interaction. While research in Embodied AI has made significant progress in training robot agents in simulated environments, interacting with humans remains challenging due to the difficulties of simulating humans. Fortunately, recent research has developed realistic simulated environments for human-to-robot handovers. Leveraging this result, we introduce a method that is trained with a human-in-the-loop via a two-stage teacher-student framework that uses motion and grasp planning, reinforcement learning, and self-supervision. We show significant performance gains over baselines on a simulation benchmark, sim-to-sim transfer and sim-to-real transfer.
翻译:我们提出了第一个学习基于点云的人机交互接機器人技术的框架,这是人机交互中至关重要的一项任务。虽然围绕“具身化智能”的研究已经在模拟环境中取得了重大进展,但是由于人类的模拟困难,与人类交互仍然具有挑战性。幸运的是,最近的研究已经为人与机器人交互提供了逼真的模拟环境。利用这个结果,我们引入了一种方法,通过两阶段的师生框架,即运动和抓取规划、强化学习和自监督,利用人类进行有监督的训练。我们证明了在模拟基准测试、模拟到模拟转移和模拟到现实转移上相对于基线的性能显著提高。