We tackle the problem of minimum-time flight for a quadrotor through a sequence of waypoints in the presence of obstacles while exploiting the full quadrotor dynamics. Early works relied on simplified dynamics or polynomial trajectory representations that did not exploit the full actuator potential of the quadrotor, and, thus, resulted in suboptimal solutions. Recent works can plan minimum-time trajectories; yet, the trajectories are executed with control methods that do not account for obstacles. Thus, a successful execution of such trajectories is prone to errors due to model mismatch and in-flight disturbances. To this end, we leverage deep reinforcement learning and classical topological path planning to train robust neural-network controllers for minimum-time quadrotor flight in cluttered environments. The resulting neural network controller demonstrates substantially better performance of up to 19\% over state-of-the-art methods. More importantly, the learned policy solves the planning and control problem simultaneously online to account for disturbances, thus achieving much higher robustness. As such, the presented method achieves 100% success rate of flying minimum-time policies without collision, while traditional planning and control approaches achieve only 40%. The proposed method is validated in both simulation and the real world, with quadrotor speeds of up to 42km/h and accelerations of 3.6g.
翻译:早期工程依靠简化的动力学或超声波轨迹图表,没有利用二次钻探器的全部振动潜能,因而导致不优化的解决办法。近期工程可以规划最短时间轨迹;然而,轨迹是用不考虑障碍的控制方法执行的。因此,由于模型不匹配和飞行干扰,成功执行这种轨迹容易出错。为此,我们利用深度加固学习和古典地貌路径规划来训练坚固的神经网络控制器,以便在封闭的环境中进行最短时间的二次钻探飞行,从而导致出现不理想的解决方案。由此产生的神经网络控制器显示,在最先进的方法上,最多可达19 ⁇ ;更重要的是,所学的政策解决了规划和控制问题,同时在网上对扰动进行核算,从而实现更大的稳健性。因此,我们提出的方法只能达到100%的神经网络控制器,同时在40度上实现最低速度的飞行速度,同时实现最低速度的飞行速度的模拟,同时实现40度的飞行速度的模拟。