双装甲对反机器人学习 (Dual-Arm Adversarial Robot Learning)

Robot learning is a very promising topic for the future of automation and machine intelligence. Future robots should be able to autonomously acquire skills, learn to represent their environment, and interact with it. While these topics have been explored in simulation, real-world robot learning research seems to be still limited. This is due to the additional challenges encountered in the real-world, such as noisy sensors and actuators, safe exploration, non-stationary dynamics, autonomous environment resetting as well as the cost of running experiments for long periods of time. Unless we develop scalable solutions to these problems, learning complex tasks involving hand-eye coordination and rich contacts will remain an untouched vision that is only feasible in controlled lab environments. We propose dual-arm settings as platforms for robot learning. Such settings enable safe data collection for acquiring manipulation skills as well as training perception modules in a robot-supervised manner. They also ease the processes of resetting the environment. Furthermore, adversarial learning could potentially boost the generalization capability of robot learning methods by maximizing the exploration based on game-theoretic objectives while ensuring safety based on collaborative task spaces. In this paper, we will discuss the potential benefits of this setup as well as the challenges and research directions that can be pursued.

翻译：机器人学习是未来自动化和机器智能的一个非常有希望的课题。未来的机器人应该能够自主地获得技能, 学会代表环境并与之互动。虽然这些课题已经在模拟中探索过, 现实世界机器人学习研究似乎仍然有限。这是由于现实世界中遇到的额外挑战, 如噪音传感器和导体、安全探索、非静止动态、自主环境再设置以及长期运行实验的成本。除非我们制定可扩展的解决方案, 否则, 学习涉及手眼协调和丰富接触的复杂任务将仍然是一种未触及的愿景, 只有在受控制的实验室环境中才可行。我们提议将双臂设置作为机器人学习的平台。这种设置有助于安全收集数据, 以获得操作技能, 以及以机器人控制的方式培训感知模块。它们也方便了环境的再造过程。此外, 对抗性学习可能提高机器人学习方法的普及能力, 其方法是根据游戏理论目标进行最大化的探索, 同时确保在协作任务空间上的安全。在本文中, 我们将讨论这项研究可能带来哪些挑战, 作为研究的延伸。