Tensegrity robots, composed of rigid rods and flexible cables, exhibit high strength-to-weight ratios and extreme deformations, enabling them to navigate unstructured terrain and even survive harsh impacts. However, they are hard to control due to their high dimensionality, complex dynamics, and coupled architecture. Physics-based simulation is one avenue for developing locomotion policies that can then be transferred to real robots, but modeling tensegrity robots is a complex task, so simulations experience a substantial sim2real gap. To address this issue, this paper describes a Real2Sim2Real strategy for tensegrity robots. This strategy is based on a differential physics engine that can be trained given limited data from a real robot (i.e. offline measurements and one random trajectory) and achieve a high enough accuracy to discover transferable locomotion policies. Beyond the overall pipeline, key contributions of this work include computing non-zero gradients at contact points, a loss function, and a trajectory segmentation technique that avoid conflicts in gradient evaluation during training. The proposed pipeline is demonstrated and evaluated on a real 3-bar tensegrity robot.
翻译:由硬棒和柔性电缆组成的灵敏机器人,其体力比和重量比高,且极端变形,使其能够在不结构的地形中航行,甚至经受严酷的冲击。然而,由于它们具有高度的维度、复杂的动态和组合结构,因此难以控制它们。物理模拟是制定移动政策的一种途径,然后可以转让给真正的机器人,但模拟时态机器人是一项复杂的任务,因此模拟会经历一个巨大的模拟差距。为了解决这个问题,本文件描述了一个Real2Sim2Real的紧张机器人战略。这个战略基于一种差异物理引擎,它可以受到来自真正的机器人的有限数据(即离线测量和一个随机轨迹)的训练,并具有足够准确性来发现可转移的移动政策。除了整个管道外,这项工作的主要贡献包括在接触点计算非零梯度梯度、损失功能以及避免在培训过程中发生冲突的轨迹分解技术。拟议的管道在真实的3压式机器人上演示和评价。