Deformable object manipulation tasks have long been regarded as challenging robotic problems. However, until recently very little work has been done on the subject, with most robotic manipulation methods being developed for rigid objects. Deformable objects are more difficult to model and simulate, which has limited the use of model-free Reinforcement Learning (RL) strategies, due to their need for large amounts of data that can only be satisfied in simulation. This paper proposes a new shape control task for Deformable Linear Objects (DLOs). More notably, we present the first study on the effects of elastoplastic properties on this type of problem. Objects with elastoplasticity such as metal wires, are found in various applications and are challenging to manipulate due to their nonlinear behavior. We first highlight the challenges of solving such a manipulation task from an RL perspective, particularly in defining the reward. Then, based on concepts from differential geometry, we propose an intrinsic shape representation using discrete curvature and torsion. Finally, we show through an empirical study that in order to successfully solve the proposed task using Deep Deterministic Policy Gradient (DDPG), the reward needs to include intrinsic information about the shape of the DLO.
翻译:长期以来,变形物体操纵任务一直被视为具有挑战性的机器人问题。然而,直到最近为止,关于这个主题的首次研究很少,大多数机器人操纵方法都是针对僵硬物体开发的。变形物体更难建模和模拟,这限制了使用无模型的强化学习(RL)战略,因为需要大量数据才能在模拟中满足。本文建议对变形线性对象(DLOs)进行新的形状控制任务。更值得注意的是,我们介绍了关于变形线性特性对此类问题的影响的首次研究。金属线条等具有弹性的物体在各种应用中都发现,由于非线性行为,难以操作。我们首先强调从变形法角度解决这种操纵任务的挑战,特别是在确定奖赏方面。然后,根据差异几何概念,我们提出一个内在形状代表,使用离散的曲度和感光度。最后,我们通过实验性研究显示,为了成功地解决使用深度梯度政策梯度梯度(DPG)的形状(DPG)的拟议任务,奖励需要包括内在信息。