The advent of deep learning (DL) gave rise to significant breakthroughs in Reinforcement Learning (RL) research. Deep Reinforcement Learning (DRL) algorithms have reached super-human level skills when applied to vision-based control problems as such in Atari 2600 games where environment states were extracted from pixel information. Unfortunately, these environments are far from being applicable to highly dynamic and complex real-world tasks as in autonomous control of a fighter aircraft since these environments only involve 2D representation of a visual world. Here, we present a semi-realistic flight simulation environment Harfang3D Dog-Fight Sandbox for fighter aircrafts. It is aimed to be a flexible toolbox for the investigation of main challenges in aviation studies using Reinforcement Learning. The program provides easy access to flight dynamics model, environment states, and aerodynamics of the plane enabling user to customize any specific task in order to build intelligent decision making (control) systems via RL. The software also allows deployment of bot aircrafts and development of multi-agent tasks. This way, multiple groups of aircrafts can be configured to be competitive or cooperative agents to perform complicated tasks including Dog Fight. During the experiments, we carried out training for two different scenarios: navigating to a designated location and within visual range (WVR) combat, shortly Dog Fight. Using Deep Reinforcement Learning techniques for both scenarios, we were able to train competent agents that exhibit human-like behaviours. Based on this results, it is confirmed that Harfang3D Dog-Fight Sandbox can be utilized as a 3D realistic RL research platform.
翻译:深层次学习(DL)的出现导致在强化学习(RL)研究方面的重大突破。深层次强化学习(DRL)算法在应用到Atari 2600游戏等基于视觉的控制问题时达到了超人水平的技能,例如,在Atari 2600游戏中,环境状态是从像素信息中提取出来的。不幸的是,这些环境远未适用于高度动态和复杂的现实世界任务,如对一架战斗机的自主控制,因为这些环境只涉及视觉世界的2D代表。在这里,我们为战斗机展示了一个半现实的飞行模拟环境 Harfang3D Dog-Fight Sandbox。它的目标是成为一个灵活的工具箱,用于调查航空研究中使用强化学习的主要挑战。这个程序为获取飞行动态模型、环境状态和飞机的空气动力提供了方便,使用户能够定制任何具体任务,以便通过RL进行智能决策(控制)系统。软件还允许部署机器人并开发多级试管任务。这个方法可以将多层飞机组合配置为竞争性或合作性机体沙箱3 。这个工具可以用来进行竞争或深层研究3级实验室 。在快速的实验室上进行复杂的飞行实验,我们可以使用一个特殊的试试探程。 。 。在这种试验中进行。