Proportional-integral-derivative (PID) control is the most widely used in industrial control, robot control and other fields. However, traditional PID control is not competent when the system cannot be accurately modeled and the operating environment is variable in real time. To tackle these problems, we propose a self-adaptive model-free SAC-PID control approach based on reinforcement learning for automatic control of mobile robots. A new hierarchical structure is developed, which includes the upper controller based on soft actor-critic (SAC), one of the most competitive continuous control algorithms, and the lower controller based on incremental PID controller. Soft actor-critic receives the dynamic information of the mobile robot as input, and simultaneously outputs the optimal parameters of incremental PID controllers to compensate for the error between the path and the mobile robot in real time. In addition, the combination of 24-neighborhood method and polynomial fitting is developed to improve the adaptability of SAC-PID control method to complex environments. The effectiveness of the SAC-PID control method is verified with several different difficulty paths both on Gazebo and real mecanum mobile robot. Futhermore, compared with fuzzy PID control, the SAC-PID method has merits of strong robustness, generalization and real-time performance.
翻译:在工业控制、机器人控制和其他领域最广泛使用的是成比例-整体-成比例-分层(PID)控制。然而,传统的PID控制在系统无法精确建模、操作环境在实时变化的情况下是不能胜任的。为了解决这些问题,我们提议了一种基于对移动机器人自动控制的强化学习的无自适应模型SAC-PID控制方法。正在开发一种新的等级结构,包括基于软动作-立方控法(SAC)的上层控制器(SAC),这是最具竞争力的连续控制算法之一,以及基于递增式PID控制器的下层控制器。 Soft 动作-critic 接收移动机器人的动态信息作为输入,同时输出渐进式PID控制器的最佳参数,以弥补路径与移动机器人之间在实时控制上的错误。此外,正在开发24个近邻方法和多米调配方制,以提高SAC-PID控制方法在复杂环境中的适应性。SAC-PID控制方法的有效性通过几种不同的困难路径得到验证,而Foze-ID-IM-Rizal-cal-calcal-rozze-cal 的精确控制方法则具有稳性性性性能。