Whole-body control of robotic manipulators with awareness of full-arm kinematics is crucial for many manipulation scenarios involving body collision avoidance or body-object interactions, which makes it insufficient to consider only the end-effector poses in policy learning. The typical approach for whole-arm manipulation is to learn actions in the robot's joint space. However, the unalignment between the joint space and actual task space (i.e., 3D space) increases the complexity of policy learning, as generalization in task space requires the policy to intrinsically understand the non-linear arm kinematics, which is difficult to learn from limited demonstrations. To address this issue, this letter proposes a kinematics-aware imitation learning framework with consistent task, observation, and action spaces, all represented in the same 3D space. Specifically, we represent both robot states and actions using a set of 3D points on the arm body, naturally aligned with the 3D point cloud observations. This spatially consistent representation improves the policy's sample efficiency and spatial generalizability while enabling full-body control. Built upon the diffusion policy, we further incorporate kinematics priors into the diffusion processes to guarantee the kinematic feasibility of output actions. The joint angle commands are finally calculated through an optimization-based whole-body inverse kinematics solver for execution. Simulation and real-world experimental results demonstrate higher success rates and stronger spatial generalizability of our approach compared to existing methods in body-aware manipulation policy learning.
翻译:具备全臂运动学感知的机器人操作器全身控制在许多涉及本体碰撞避免或本体-物体交互的操作场景中至关重要,这使得在策略学习中仅考虑末端执行器姿态变得不足。全臂操作的典型方法是在机器人关节空间中学习动作。然而,关节空间与实际任务空间(即三维空间)之间的不对齐增加了策略学习的复杂性,因为任务空间的泛化要求策略必须内在地理解非线性手臂运动学,而这难以从有限的示教数据中学习。为解决此问题,本文提出一种运动学感知的模仿学习框架,其任务空间、观测空间与动作空间均保持统一,并全部表征在相同的三维空间中。具体而言,我们使用手臂本体上的一组三维点来同时表征机器人状态与动作,该表征方式与三维点云观测天然对齐。这种空间一致的表征提升了策略的样本效率与空间泛化能力,同时实现了全身控制。基于扩散策略,我们进一步将运动学先验融入扩散过程,以保证输出动作的运动学可行性。最终通过基于优化的全身逆运动学求解器计算关节角度指令以执行操作。仿真与真实世界实验结果表明,在具备本体感知的操作策略学习中,相比现有方法,本方法具有更高的成功率和更强的空间泛化能力。