StarCraft II (SC2) is a real-time strategy game in which players produce and control multiple units to fight against opponent's units. Due to its difficulties, such as huge state space, various action space, a long time horizon, and imperfect information, SC2 has been a research hotspot in reinforcement learning. Recently, an agent called AlphaStar (AS) has been proposed, which shows good performance, obtaining a high win rate of 99.8% against human players. We implemented a mini-scaled version of it called mini-AlphaStar (mAS) based on AS's paper and pseudocode. The difference between AS and mAS is that we substituted the hyper-parameters of AS with smaller ones for mini-scale training. Codes of mAS are all open-sourced (https://github.com/liuruoze/mini-AlphaStar) for future research.
翻译:StarCraft II(SC2)是一个实时战略游戏,玩家在游戏中生产和控制多个单位来对抗对手的单位。由于它的困难,如巨大的国家空间、各种行动空间、漫长的时间跨度和信息不完善,SC2一直是强化学习的研究热点。最近,有人提议了一个名为AlphaStar(AS)的代理物,它表现良好,对人类玩家的赢率高达99.8%。我们根据AS的纸张和假代码实施了称为Mini-AlphaStar(MAS)的小型版本。AS和MAS的区别在于,我们用小型培训用较小的单子代替AS。MAS的代码都是开放的(https://github.com/liuroze/mini-AlphaStar),供未来研究使用。