This paper proposes a general formulation for temporal parallelisation of dynamic programming for optimal control problems. We derive the elements and associative operators to be able to use parallel scans to solve these problems with logarithmic time complexity rather than linear time complexity. We apply this methodology to problems with finite state and control spaces, linear quadratic tracking control problems, and to a class of nonlinear control problems. The computational benefits of the parallel methods are demonstrated via numerical simulations run on a graphics processing unit.
翻译:本文为最佳控制问题提出了动态编程时间平行化的通用提法。 我们从元素和关联操作器中得出元素和关联操作器,以便能够使用平行扫描来用对数时间复杂度而不是线性时间复杂度解决这些问题。 我们将这种方法应用于有限的状态和控制空间问题、线性二次跟踪控制问题,以及非线性控制问题。 平行方法的计算效益通过图形处理单位的数值模拟来证明。