Humanoids hold great potential for service, industrial, and rescue applications, in which robots must sustain whole-body stability while performing intense, contact-rich interactions with the environment. However, enabling humanoids to generate human-like, adaptive responses under such conditions remains a major challenge. To address this, we propose Thor, a humanoid framework for human-level whole-body reactions in contact-rich environments. Based on the robot's force analysis, we design a force-adaptive torso-tilt (FAT2) reward function to encourage humanoids to exhibit human-like responses during force-interaction tasks. To mitigate the high-dimensional challenges of humanoid control, Thor introduces a reinforcement learning architecture that decouples the upper body, waist, and lower body. Each component shares global observations of the whole body and jointly updates its parameters. Finally, we deploy Thor on the Unitree G1, and it substantially outperforms baselines in force-interaction tasks. Specifically, the robot achieves a peak pulling force of 167.7 N (approximately 48% of the G1's body weight) when moving backward and 145.5 N when moving forward, representing improvements of 68.9% and 74.7%, respectively, compared with the best-performing baseline. Moreover, Thor is capable of pulling a loaded rack (130 N) and opening a fire door with one hand (60 N). These results highlight Thor's effectiveness in enhancing humanoid force-interaction capabilities.
翻译:人形机器人在服务、工业和救援应用中具有巨大潜力,这些场景要求机器人在执行高强度、接触密集的环境交互时保持全身稳定性。然而,使类人机器人在此类条件下产生类人的自适应响应仍是一项重大挑战。为此,我们提出Thor,一种面向接触密集环境中人体水平全身反应的人形机器人框架。基于机器人的受力分析,我们设计了力自适应躯干倾斜(FAT2)奖励函数,以激励人形机器人在力交互任务中表现出类人响应。为缓解人形机器人控制的高维挑战,Thor引入了一种强化学习架构,将上半身、腰部和下半身解耦。各组件共享全身的全局观测并联合更新其参数。最终,我们将Thor部署于Unitree G1机器人上,其在力交互任务中显著优于基线方法。具体而言,机器人在后移时达到167.7 N的峰值拉力(约为G1体重的48%),前移时达到145.5 N,相比性能最佳的基线分别提升了68.9%和74.7%。此外,Thor能够拉动载重货架(130 N)并用单手开启防火门(60 N)。这些结果突显了Thor在增强人形机器人力交互能力方面的有效性。