Frictional contact has been extensively studied as the core underlying behavior of legged locomotion and manipulation, and its nearly-discontinuous nature makes planning and control difficult even when an accurate model of the robot is available. Here, we present empirical evidence that learning an accurate model in the first place can be confounded by contact, as modern deep learning approaches are not designed to capture this non-smoothness. We isolate the effects of contact's non-smoothness by varying the mechanical stiffness of a compliant contact simulator. Even for a simple system, we find that stiffness alone dramatically degrades training processes, generalization, and data-efficiency. Our results raise serious questions about simulated testing environments which do not accurately reflect the stiffness of rigid robotic hardware. Significant additional investigation will be necessary to fully understand and mitigate these effects, and we suggest several avenues for future study.
翻译:作为腿动和操纵的核心基本行为,我们广泛研究了断流接触,因为其几乎不连续的性质使规划和控制变得困难,即使有精确的机器人模型。在这里,我们提出经验证据,证明首先学习精确模型可能会被接触所困扰,因为现代深层次的学习方法并不是为了捕捉这种非移动性而设计的。我们通过兼容的模拟接触器的机械性能将接触不运动的影响分离出来。即使对于一个简单的系统,我们发现僵化本身会极大地削弱培训过程、一般化和数据效率。我们的结果提出了模拟测试环境的严重问题,这些环境不能准确地反映僵硬的机器人硬件的僵硬性。为了充分理解和减轻这些影响,还需要进行大量的额外调查,我们建议今后研究若干途径。