Learning the evolution of real-time strategy (RTS) game is a challenging problem in artificial intelligent (AI) system. In this paper, we present a novel Hebbian learning method to extract the global feature of point sets in StarCraft II game units, and its application to predict the movement of the points. Our model includes encoder, LSTM, and decoder, and we train the encoder with the unsupervised learning method. We introduce the concept of neuron activity aware learning combined with k-Winner-Takes-All. The optimal value of neuron activity is mathematically derived, and experiments support the effectiveness of the concept over the downstream task. Our Hebbian learning rule benefits the prediction with lower loss compared to self-supervised learning. Also, our model significantly saves the computational cost such as activations and FLOPs compared to a frame-based approach.
翻译:学习实时策略(RTS)游戏的进化是人工智能(AI)系统中一个具有挑战性的问题。在本文中,我们展示了一种新型的赫比亚学习方法,以提取StarCraft II游戏单元中点集的全球特征,及其用于预测点的变化。我们的模型包括编码器、 LSTM 和解码器,我们用不受监督的学习方法来培训编码器。我们引入了神经活动意识学习的概念,与 k-Winner-Takes-All 相结合。神经活动的最佳价值是从数学上推算出来的,实验也支持了该概念在下游任务上的有效性。我们的赫比亚学习规则有利于预测,与自我监督的学习相比损失较低。此外,我们的模型大大节省了计算成本,比如激活和FLOPs,而不是基于框架的方法。