用于解释深强化学习剂的基于扰扰动的弹性地图 (Benchmarking Perturbation-based Saliency Maps for Explaining Deep Reinforcement Learning Agents)

Recent years saw a plethora of work on explaining complex intelligent agents. One example is the development of several algorithms that generate saliency maps which show how much each pixel attributed to the agents' decision. However, most evaluations of such saliency maps focus on image classification tasks. As far as we know, there is no work which thoroughly compares different saliency maps for Deep Reinforcement Learning agents. This paper compares four perturbation-based approaches to create saliency maps for Deep Reinforcement Learning agents trained on four different Atari 2600 games. All four approaches work by perturbing parts of the input and measuring how much this affects the agent's output. The approaches are compared using three computational metrics: dependence on the learned parameters of the agent (sanity checks), faithfulness to the agent's reasoning (input degradation), and run-time.

翻译：近些年来,在解释复杂的智能剂方面做了大量工作。一个例子是开发了几种算法,这些算法生成了突出的地图,显示每个像素在多大程度上归因于代理人的决定。然而,对此类突出象素的大多数评价都侧重于图像分类任务。据我们所知,没有一项工作对深强化学习剂的不同突出地图进行彻底比较。本文比较了四种以扰动为基础的方法,为在四个不同的Atari 2600游戏中接受培训的深强化学习剂绘制突出的地图。所有四种方法都通过破坏部分投入并测量它对代理人产出的影响程度。这些方法用三种计算指标进行比较:依赖代理人的学习参数(卫生检查)、对代理人推理的忠诚性(投入退化)以及运行时间。

相关内容

深度强化学习

关注 154

深度强化学习 (DRL) 是一种使用深度学习技术扩展传统强化学习方法的一种机器学习方法。传统强化学习方法的主要任务是使得主体根据从环境中获得的奖赏能够学习到最大化奖赏的行为。然而，传统无模型强化学习方法需要使用函数逼近技术使得主体能够学习出值函数或者策略。在这种情况下，深度学习强大的函数逼近能力自然成为了替代人工指定特征的最好手段并为性能更好的端到端学习的实现提供了可能。

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

121+阅读 · 2020年5月18日

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日