Tensegrity robots, composed of rigid rods and flexible cables, are difficult to accurately model and control given the presence of complex dynamics and high number of DoFs. Differentiable physics engines have been recently proposed as a data-driven approach for model identification of such complex robotic systems. These engines are often executed at a high-frequency to achieve accurate simulation. Ground truth trajectories for training differentiable engines, however, are not typically available at such high frequencies due to limitations of real-world sensors. The present work focuses on this frequency mismatch, which impacts the modeling accuracy. We proposed a recurrent structure for a differentiable physics engine of tensegrity robots, which can be trained effectively even with low-frequency trajectories. To train this new recurrent engine in a robust way, this work introduces relative to prior work: (i) a new implicit integration scheme, (ii) a progressive training pipeline, and (iii) a differentiable collision checker. A model of NASA's icosahedron SUPERballBot on MuJoCo is used as the ground truth system to collect training data. Simulated experiments show that once the recurrent differentiable engine has been trained given the low-frequency trajectories from MuJoCo, it is able to match the behavior of MuJoCo's system. The criterion for success is whether a locomotion strategy learned using the differentiable engine can be transferred back to the ground-truth system and result in a similar motion. Notably, the amount of ground truth data needed to train the differentiable engine, such that the policy is transferable to the ground truth system, is 1% of the data needed to train the policy directly on the ground-truth system.
翻译:由硬棒和弹性电缆组成的感光机器人,由于存在复杂动态和大量DoFs,很难精确地建模和控制其频率错配。最近提出了不同物理引擎,作为数据驱动的方法,用于模拟这类复杂机器人系统的模型识别。这些引擎通常在高频执行,以便实现准确模拟。由于真实世界传感器的局限性,培训不同引擎的地面真实轨迹通常无法在如此高的频率上找到。目前的工作侧重于影响模型精确度的频率错配。我们建议为可区别的紧张性机器人物理学引擎建立一个经常性结构,即使低频轨迹也能对其进行有效培训。为了以稳健的方式培训这个新的重复性引擎,这项工作与以前的工作相对:(一) 一个新的隐性整合计划,(二) 渐进式培训管道,以及(三) 不同频率的碰撞检查器。美国航天局的可变电压SUPERBallBot的模型可以用作地面真相系统,用于收集数据,而常规透明透明透明性机器人的轨迹测试显示,一旦经过不同的地面系统,磁力定位系统需要不同的地面数据, 即直接地测测算。