Towards the grand challenge of achieving human-level manipulation in robots, playing piano is a compelling testbed that requires strategic, precise, and flowing movements. Over the years, several works demonstrated hand-designed controllers on real world piano playing, while other works evaluated robot learning approaches on simulated piano scenarios. In this paper, we develop the first piano playing robotic system that makes use of learning approaches while also being deployed on a real world dexterous robot. Specifically, we make use of Sim2Real to train a policy in simulation using reinforcement learning before deploying the learned policy on a real world dexterous robot. In our experiments, we thoroughly evaluate the interplay between domain randomization and the accuracy of the dynamics model used in simulation. Moreover, we evaluate the robot's performance across multiple songs with varying complexity to study the generalization of our learned policy. By providing a proof-of-concept of learning to play piano in the real world, we want to encourage the community to adopt piano playing as a compelling benchmark towards human-level manipulation. We open-source our code and show additional videos at https://lasr.org/research/learning-to-play-piano .
翻译:为实现机器人达到人类水平操作能力这一重大挑战,弹奏钢琴作为一个引人注目的测试平台,要求机器人具备策略性、精确且流畅的动作。多年来,已有若干研究展示了在真实钢琴演奏中采用手工设计控制器的工作,而另一些研究则在模拟钢琴场景中评估了机器人学习方法。本文开发了首个利用学习方法并部署于真实灵巧机器人的钢琴演奏机器人系统。具体而言,我们采用Sim2Real方法,先在仿真环境中通过强化学习训练策略,再将习得的策略部署于真实灵巧机器人。实验中,我们系统评估了领域随机化与仿真所用动力学模型精度之间的相互作用。此外,我们通过测试机器人在不同复杂度乐曲上的表现,研究了所学策略的泛化能力。通过提供在现实世界中学习弹奏钢琴的概念验证,我们希望推动学界将钢琴演奏作为实现人类水平操作能力的重要基准。我们在https://lasr.org/research/learning-to-play-piano开源了代码并提供了补充视频。