Artificial Intelligence (AI) is becoming a critical component in the defense industry, as recently demonstrated by DARPA`s AlphaDogfight Trials (ADT). ADT sought to vet the feasibility of AI algorithms capable of piloting an F-16 in simulated air-to-air combat. As a participant in ADT, Lockheed Martin`s (LM) approach combines a hierarchical architecture with maximum-entropy reinforcement learning (RL), integrates expert knowledge through reward shaping, and supports modularity of policies. This approach achieved a $2^{nd}$ place finish in the final ADT event (among eight total competitors) and defeated a graduate of the US Air Force's (USAF) F-16 Weapons Instructor Course in match play.
翻译:人工智能(AI)正在成为国防行业的一个关键组成部分,正如DARPA's AlphaDogfight Treating(ADT)最近所证明的那样。DARPA's AlphaDogfight Treating(ADT)试图审查能够模拟空对空战斗中F-16实验的AI算法的可行性。作为ADT的参与者,Lockheed Martin's(LM)的方法将等级结构与最大限度的增殖学习(RL)相结合,通过奖赏塑造整合专家知识并支持政策的模块化。 这一方法在最后ADDT活动中(共8个竞争对手中)获得了2美元,并在比赛中击败了美国空军F-16武器教官课程的毕业生。