AI控制的FES恢复手臂运动：使用高斯状态空间模型控制渐进性肌肉疲劳 (Towards AI-controlled FES-restoration of arm movements: Controlling for progressive muscular fatigue with Gaussian state-space models)

Reaching disability limits an individual's ability in performing daily tasks. Surface Functional Electrical Stimulation (FES) offers a non-invasive solution to restore the lost abilities. However, inducing desired movements using FES is still an open engineering problem. This problem is accentuated by the complexities of human arms' neuromechanics and the variations across individuals. Reinforcement Learning (RL) emerges as a promising approach to govern customised control rules for different subjects and settings. Yet, one remaining challenge of using RL to control FES is unobservable muscle fatigue that progressively changes as an unknown function of the stimulation, breaking the Markovian assumption of RL. In this work, we present a method to address the unobservable muscle fatigue issue, allowing our RL controller to achieve higher control performances. Our method is based on a Gaussian State-Space Model (GSSM) that utilizes recurrent neural networks to learn Markovian state-spaces from partial observations. The GSSM is used as a filter that converts the observations into the state-space representation for RL to preserve the Markovian assumption. Here, we start with presenting the modification of the original GSSM to address an overconfident issue. We then present the interaction between RL and the modified GSSM, followed by the setup for FES control learning. We test our RL-GSSM system on a planar reaching setting in simulation using a detailed neuromechanical model and show that the GSSM can help RL maintain its control performance against the fatigue.

翻译：手部活动能力的丧失限制了一个人日常任务的能力。表面功能性电刺激（FES）提供了恢复失去能力的非侵入性解决方案。然而，使用FES诱导所需的运动仍然是一个开放性的工程问题。这个问题还因人类手臂的神经机械复杂性、不同个体间的差异而变得更加困难。强化学习（RL）成为一种有前途的方法，可以为不同的受试者和设置制定定制化的控制规则。然而，使用RL控制FES仍存在一个挑战，即不可观察的肌肉疲劳会随着刺激而不断变化，破坏了RL的马尔可夫假设。在这项工作中，我们提供了一种解决不能观察肌肉疲劳问题的方法，使我们的RL控制器可以实现更高的控制性能。我们的方法基于高斯状态空间模型（GSSM），利用循环神经网络从部分观察中学习马尔可夫状态空间。GSSM被用作过滤器，将观测转换为状态空间表示，以保持马尔可夫假设。在这里，我们从介绍修改原始GSSM开始，以解决自信问题。然后介绍RL和修改后的GSSM之间的交互，并介绍FES控制学习的设置。我们在仿真中使用详细的神经机械模型对我们的RL-GSSM系统进行测试，展示了GSSM如何帮助RL对抗疲劳以保持其控制性能。