为视觉复杂视频游戏使用低维多维观察过滤器进行深强化学习 (Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing)

Deep Reinforcement Learning (DRL) has produced great achievements since it was proposed, including the possibility of processing raw vision input data. However, training an agent to perform tasks based on image feedback remains a challenge. It requires the processing of large amounts of data from high-dimensional observation spaces, frame by frame, and the agent's actions are computed according to deep neural network policies, end-to-end. Image pre-processing is an effective way of reducing these high dimensional spaces, eliminating unnecessary information present in the scene, supporting the extraction of features and their representations in the agent's neural network. Modern video-games are examples of this type of challenge for DRL algorithms because of their visual complexity. In this paper, we propose a low-dimensional observation filter that allows a deep Q-network agent to successfully play in a visually complex and modern video-game, called Neon Drive.

翻译：自提出以来,深强化学习(DRL)取得了巨大成就,包括处理原始视觉输入数据的可能性。然而,培训一名代理人员执行基于图像反馈的任务仍是一项挑战。这需要处理来自高维观测空间的大量数据、框架框架和代理人员的行动根据深神经网络政策、端到端计算。图像预处理是减少这些高维空间、消除现场存在的不必要信息、支持提取特征及其在代理人员神经网络中的体现的有效方法。现代视频游戏是DRL算法因其视觉复杂性而面临这类挑战的例子。在本文中,我们提议了一个低维观测过滤器,使深Q网络代理能够在视觉复杂和现代视频游戏中成功播放,称为Neon驱动器。

相关内容

深度强化学习

关注 154

深度强化学习 (DRL) 是一种使用深度学习技术扩展传统强化学习方法的一种机器学习方法。传统强化学习方法的主要任务是使得主体根据从环境中获得的奖赏能够学习到最大化奖赏的行为。然而，传统无模型强化学习方法需要使用函数逼近技术使得主体能够学习出值函数或者策略。在这种情况下，深度学习强大的函数逼近能力自然成为了替代人工指定特征的最好手段并为性能更好的端到端学习的实现提供了可能。

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日