Continuous in-hand manipulation is an important physical interaction skill, where tactile sensing provides indispensable contact information to enable dexterous manipulation of small objects. This work proposed a framework for end-to-end policy learning with tactile feedback and sim-to-real transfer, which achieved fine in-hand manipulation that controls the pose of a thin cylindrical object, such as a long stick, to track various continuous trajectories through multiple contacts of three fingertips of a dexterous robot hand with tactile sensor arrays. We estimated the central contact position between the stick and each fingertip from the high-dimensional tactile information and showed that the learned policies achieved effective manipulation performance with the processed tactile feedback. The policies were trained with deep reinforcement learning in simulation and successfully transferred to real-world experiments, using coordinated model calibration and domain randomization. We evaluated the effectiveness of tactile information via comparative studies and validated the sim-to-real performance through real-world experiments.
翻译:持续的手内操纵是一种重要的物理交互技能,触觉感知器提供不可或缺的联系信息,以实现对小物体的熟练操纵。本研究提出了一种基于触觉反馈和模拟到实际传输的端到端策略学习框架,在这个框架下,可以通过三个指尖上的触觉感知器阵列多次接触来控制细长圆柱体物体的姿态,如长棍子等,以跟踪各种连续轨迹。我们从高维触觉信息中估计了棍子与每个指尖之间的中心接触位置,并展示了所学策略通过处理后的触觉反馈实现的有效操纵性能。通过在模拟中使用深度强化学习来训练策略,并使用协调的模型校准和域随机化将其成功地转移到实际实验中。我们通过比较研究评估了触觉信息的有效性,并通过实际实验验证了模拟到实际的性能。