In-hand object reorientation is necessary for performing many dexterous manipulation tasks, such as tool use in unstructured environments that remain beyond the reach of current robots. Prior works built reorientation systems that assume one or many of the following specific circumstances: reorienting only specific objects with simple shapes, limited range of reorientation, slow or quasistatic manipulation, the need for specialized and costly sensor suites, simulation-only results, and other constraints which make the system infeasible for real-world deployment. We overcome these limitations and present a general object reorientation controller that is trained using reinforcement learning in simulation and evaluated in the real world. Our system uses readings from a single commodity depth camera to dynamically reorient complex objects by any amount in real time. The controller generalizes to novel objects not used during training. It is successful in the most challenging test: the ability to reorient objects in the air held by a downward-facing hand that must counteract gravity during reorientation. The results demonstrate that the policy transfer from simulation to the real world can be accomplished even for dynamic and contact-rich tasks. Lastly, our hardware only uses open-source components that cost less than five thousand dollars. Such construction makes it possible to replicate the work and democratize future research in dexterous manipulation. Videos are available at: https://taochenshh.github.io/projects/visual-dexterity.
翻译:手持物体调整是完成许多复杂操作任务所必需的,例如,在目前机器人无法进入的不结构环境中使用工具,目前机器人无法进入的非结构化环境中使用工具。先前的工程建造的调整方向系统假定了以下一种或多种具体环境:只调整形状、调整范围有限、缓慢或准静态的调整、需要专门和昂贵的传感器套件、只进行模拟的结果,以及使系统无法用于实际世界部署的其他限制。我们克服了这些限制,并提出了一个通用的物体调整控制器,该控制器在现实世界中利用强化的模拟学习和评价来进行培训。我们的系统使用单一商品深度摄像头的读数,实时地以动态方式调整复杂的物体。控制器一般用于培训期间没有使用的新物体。在最具有挑战性的测试中是成功的:在调整方向时,用下向式的手对物体进行调整的能力必须抵消重力。结果显示,即使为动态和接触丰富的任务,也可以完成从模拟到真实世界的政策转移。最后,我们的硬件只使用开放源组件,其成本小于5 000次的复制/视听操作。这样可以进行模拟研究。在将来复制。这种研究中进行。这种模拟工作是可能的。可以进行。