深强化学习中异常探测简单统一框架 (A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning)

Abnormal states in deep reinforcement learning~(RL) are states that are beyond the scope of an RL policy. Such states may make the RL system unsafe and impede its deployment in real scenarios. In this paper, we propose a simple yet effective anomaly detection framework for deep RL algorithms that simultaneously considers random, adversarial and out-of-distribution~(OOD) state outliers. In particular, we attain the class-conditional distributions for each action class under the Gaussian assumption, and rely on these distributions to discriminate between inliers and outliers based on Mahalanobis Distance~(MD) and Robust Mahalanobis Distance. We conduct extensive experiments on Atari games that verify the effectiveness of our detection strategies. To the best of our knowledge, we present the first in-detail study of statistical and adversarial anomaly detection in deep RL algorithms. This simple unified anomaly detection paves the way towards deploying safe RL systems in real-world applications.

翻译：超常状态深强化学习 ~ (RL) 是超出 RL 政策范围的状态。这样的状态可能会使 RL 系统不安全, 并阻碍其在真实情况下的部署。在本文中, 我们为深RL 算法提出一个简单而有效的异常检测框架, 同时考虑随机、对抗性和分配外 ~ (OOOD) 状态外。特别是, 我们根据 Gaussian 假设, 获得每个行动组的等级条件分布, 并依靠这些分布来区分基于 Mahalanobis 距离~ (MD) 和 Robust Mahalanobis 距离的异端点和异端点。我们在 Atari 游戏上进行了广泛的实验, 以验证我们的检测策略的有效性。我们据我们所知, 第一次在深 RL 算法中进行统计和对抗性异常现象检测。这种简单统一的异常点探测为在现实应用中部署安全 RL 系统铺平铺平了道路。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

基于图的异常检测，94页ppt

专知会员服务

77+阅读 · 2021年9月27日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

62+阅读 · 2020年2月17日