FlapAI Bird:培训一名利用强化学习技术玩玩小鸟的代理 (FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning Techniques)

Reinforcement learning is one of the most popular approaches for automated game playing. This method allows an agent to estimate the expected utility of its state in order to make optimal actions in an unknown environment. We seek to apply reinforcement learning algorithms to the game Flappy Bird. We implement SARSA and Q-Learning with some modifications such as $\epsilon$-greedy policy, discretization and backward updates. We find that SARSA and Q-Learning outperform the baseline, regularly achieving scores of 1400+, with the highest in-game score of 2069.

翻译：强化学习是最受欢迎的自动游戏游戏方法之一。这种方法使代理商能够估计其状态的预期效用, 以便在未知环境中采取最佳行动。我们试图将强化学习算法应用到游戏 Flappy Bird 。我们实施SASA 和 Q- Learning, 进行一些修改, 如 $\ epsilon$- greedy 政策、离散和后退更新。我们发现SASA 和 Q- Lecear 都超过了基准, 经常达到 1400 + 的分数, 最高在赛中得分为 2069 。

相关内容

Flappy Bird

关注 0

Flappy Bird （飞扬的小鸟 、 像素鸟、下坠的小鸟、笨鸟） 是一款由来自越南的独立游戏开发者Dong Nguyen所开发的作品，游戏于2013年5月24日上线，并在2014年2月突然暴红。
2014年2月，《Flappy Bird》被开发者本人从苹果及谷歌应用商店撤下。2014年8月份正式回归APP STORE，正式加入Flappy迷们期待已久的多人对战模式。游戏中玩家必须控制一只小鸟，跨越由各种不同长度水管所组成的障碍。

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【论文】欺骗学习（Learning by Cheating）

专知会员服务

28+阅读 · 2020年1月3日

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日