In many robotic tasks, such as autonomous drone racing, the goal is to travel through a set of waypoints as fast as possible. A key challenge for this task is planning the time-optimal trajectory, which is typically solved by assuming perfect knowledge of the waypoints to pass in advance. The resulting solution is either highly specialized for a single-track layout, or suboptimal due to simplifying assumptions about the platform dynamics. In this work, a new approach to near-time-optimal trajectory generation for quadrotors is presented. Leveraging deep reinforcement learning and relative gate observations, our approach can compute near-time-optimal trajectories and adapt the trajectory to environment changes. Our method exhibits computational advantages over approaches based on trajectory optimization for non-trivial track configurations. The proposed approach is evaluated on a set of race tracks in simulation and the real world, achieving speeds of up to 60 km/h with a physical quadrotor.
翻译:在许多机器人任务中,如自动无人驾驶飞行器赛车,目标是尽可能快地通过一套路径点进行旅行。任务的关键挑战是如何规划最理想的时间轨迹,通常通过假设对预通过路径点的绝对知识来解决这个问题。因此产生的解决方案要么高度专门用于单轨布局,要么由于简化了对平台动态的假设而不够理想。在这项工作中,提出了一种为二次钻探者创造近时最佳轨道的新方法。利用深度加固学习和相对门观测,我们的方法可以计算近时最佳轨迹,并适应环境变化。我们的方法展示了基于非三轨轨道配置轨迹优化的方法的计算优势。在模拟和真实世界的一组种族轨迹上对拟议方法进行了评估,与一个物理四轨迹实现高达60公里/小时的速度。