We propose a Proportional Integral Derivative (PID) controller-based coaching scheme to expedite reinforcement learning (RL).
翻译:我们建议采用比例综合衍生法(PID)控制器指导计划,以加快强化学习。