A significant challenge for the control of a robotic lower extremity rehabilitation exoskeleton is to ensure stability and robustness during programmed tasks or motions, which is crucial for the safety of the mobility-impaired user. Due to various levels of the user's disability, the human-exoskeleton interaction forces and external perturbations are unpredictable and could vary substantially and cause conventional motion controllers to behave unreliably or the robot to fall down. In this work, we propose a new, reinforcement learning-based, motion controller for a lower extremity rehabilitation exoskeleton, aiming to perform collaborative squatting exercises with efficiency, stability, and strong robustness. Unlike most existing rehabilitation exoskeletons, our exoskeleton has ankle actuation on both sagittal and front planes and is equipped with multiple foot force sensors to estimate center of pressure (CoP), an important indicator of system balance. This proposed motion controller takes advantage of the CoP information by incorporating it in the state input of the control policy network and adding it to the reward during the learning to maintain a well balanced system state during motions. In addition, we use dynamics randomization and adversary force perturbations including large human interaction forces during the training to further improve control robustness. To evaluate the effectiveness of the learning controller, we conduct numerical experiments with different settings to demonstrate its remarkable ability on controlling the exoskeleton to repetitively perform well balanced and robust squatting motions under strong perturbations and realistic human interaction forces.
翻译:在这项工作中,我们提出了一个新的强化学习型运动控制器,用于低极限康复型运动控制器,目的是以效率、稳定性和强健的强健性来进行协作性占住练习,这与大多数现有的恢复型前骨架不同,因为用户残疾程度不同,人体-外骨骼互动力量和外部扰动作用不可预测,并可能大不相同,并导致常规运动控制器行为不可靠或机器人下降。在这项工作中,我们提议一个新的强化学习型运动控制器,用于低极限康复型外骨骼,目的是以效率、稳定性和强健力来进行协作性占地操练。与大多数现有的恢复型外骨架用户不同,我们的外骨骼互动力量在平面和前翼飞机上都有脚踝动作感,并配有多种脚力感应器,以估计压力中心(CoP),这是系统平衡的重要指标。我们提议的运动控制器将它纳入控制政策网络的状态投入,并在学习的强力期间添加奖励,以保持高度稳健健的稳健性运动,包括人类运动的动态期间,我们使用机动的机动状态,以机动状态来进行弹性控制。