Cloth manipulation is a challenging task due to the many degrees of freedom and properties of the material affecting the dynamics of the cloth. The nonlinear dynamics of the cloth have particularly strong significance in dynamic cloth manipulation, where some parts of the cloth are not directly controllable. In this paper, we present a novel approach for solving dynamic cloth manipulation by training policies using reinforcement learning (RL) in simulation and transferring the learned policies to the real world in a zero-shot manner. The proposed method uses visual feedback and material property randomization in a physics simulator to achieve generalization in the real world. Experimental results show that using only visual feedback is enough for the policies to learn the dynamic manipulation task in a way that transfers from simulation to the real world. In addition, the randomization of the dynamics in simulation enables capturing the behavior of a variety of cloths in the real world.
翻译:服装操纵是一项艰巨的任务,因为影响布质动态的材料具有多种程度的自由性和特性。布的非线性动态在动态布的操纵中具有特别重大的意义,因为布的某些部分无法直接控制。在本文中,我们提出了一个新颖的方法,通过培训政策解决动态布的操纵,在模拟过程中使用强化学习(RL),以零发方式将学到的政策转移到现实世界。拟议方法使用物理学模拟器的视觉反馈和材料属性随机化来实现现实世界的普及化。实验结果显示,仅使用视觉反馈就足以让政策学习动态布的操作任务,从而从模拟转移到现实世界。此外,模拟中的动态随机化还能够捕捉到真实世界中各种布体的行为。