In this paper, we introduce a novel deep learning based solution to the Powered-Descent Guidance (PDG) problem, grounded in principles of nonlinear Stochastic Optimal Control (SOC) and Feynman-Kac theory. Our algorithm solves the PDG problem by framing it as an $\mathcal{L}^1$ SOC problem for minimum fuel consumption. Additionally, it can handle practically useful control constraints, nonlinear dynamics and enforces state constraints as soft-constraints. This is achieved by building off of recent work on deep Forward-Backward Stochastic Differential Equations (FBSDEs) and differentiable non-convex optimization neural-network layers based on stochastic search. In contrast to previous approaches, our algorithm does not require convexification of the constraints or linearization of the dynamics and is empirically shown to be robust to stochastic disturbances and the initial position of the spacecraft. After training offline, our controller can be activated once the spacecraft is within a pre-specified radius of the landing zone and at a pre-specified altitude i.e., the base of an inverted cone with the tip at the landing zone. We demonstrate empirically that our controller can successfully and safely land all trajectories initialized at the base of this cone while minimizing fuel consumption.
翻译:在本文中,我们引入了基于非线性软骨最佳控制(SOC)和Feynman-Kac理论原则的新颖的深层次学习解决方案。我们的算法通过将PDG问题设置为用于最低燃料消耗量的$mathcal{L ⁇ 1$SOC问题来解决PDG问题。此外,它可以处理实际有用的控制限制、非线性动态以及将状态限制作为软约束。这是通过在基于Stochacistic搜索的、关于深层前方-后方托盘差异(FBSDEs)和不同、不可调控的、优化神经网络层等原则的近期工作来完成的。与以往的做法不同,我们的算法并不要求将限制或动态线性地划成对航天器的干扰和初始位置。在培训离线后,一旦航天器位于着陆区前指定的半径内,并且位于该着陆区前的一个可移动式优化的神经-网络层层层层层层层层层层层,就可以成功地展示我们所有水平的地面基础。