Deep reinforcement learning, and especially the Asynchronous Advantage Actor-Critic algorithm, has been successfully used to achieve super-human performance in a variety of video games. Starcraft II is a new challenge for the reinforcement learning community with the release of pysc2 learning environment proposed by Google Deepmind and Blizzard Entertainment. Despite being a target for several AI developers, few have achieved human level performance. In this project we explain the complexities of this environment and discuss the results from our experiments on the environment. We have compared various architectures and have proved that transfer learning can be an effective paradigm in reinforcement learning research for complex scenarios requiring skill transfer.
翻译:深层强化学习,特别是Asyncronous Advantage Actor-Criztic 算法,已被成功地用于在各种视频游戏中实现超人性表演。Starcraft II是加强学习界的新挑战,谷歌Deepmind和Blizzard Internaltainment提出了释放Pyssc2学习环境的建议。尽管这是数位AI开发者的目标,但很少有人能达到人类水平的绩效。在这个项目中,我们解释了这种环境的复杂性,并讨论了我们环境实验的结果。我们比较了各种结构,并证明转移学习可以成为加强需要技能转移的复杂情景的学习研究的有效范例。