政策蒸馏,采用选择性投入渐进常规化,实现高效解释 (Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability) - 专知论文

会员服务 ·

0

正则化项 · 蒸馏 · 显著图 · Atari · 稳健性 ·

2022 年 5 月 18 日

Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

翻译：政策蒸馏,采用选择性投入渐进常规化,实现高效解释

Jinwei Xing,Takashi Nagata,Xinyun Zou,Emre Neftci,Jeffrey L. Krichmar

Although deep Reinforcement Learning (RL) has proven successful in a wide range of tasks, one challenge it faces is interpretability when applied to real-world problems. Saliency maps are frequently used to provide interpretability for deep neural networks. However, in the RL domain, existing saliency map approaches are either computationally expensive and thus cannot satisfy the real-time requirement of real-world scenarios or cannot produce interpretable saliency maps for RL policies. In this work, we propose an approach of Distillation with selective Input Gradient Regularization (DIGR) which uses policy distillation and input gradient regularization to produce new policies that achieve both high interpretability and computation efficiency in generating saliency maps. Our approach is also found to improve the robustness of RL policies to multiple adversarial attacks. We conduct experiments on three tasks, MiniGrid (Fetch Object), Atari (Breakout) and CARLA Autonomous Driving, to demonstrate the importance and effectiveness of our approach.

翻译：虽然深强化学习(RL)在一系列广泛任务中证明是成功的,但它面临的一个挑战是在应用到现实世界的问题时可解释性; 精度地图经常被用来为深神经网络提供解释性; 然而,在RL领域,现有的突出度地图方法要么计算成本昂贵,因而无法满足现实世界情景的实时要求,要么无法为RL政策制作可解释的突出地图; 在这项工作中,我们建议采用有选择的输入加速常规化(DIGR)的蒸馏方法,利用政策蒸馏和输入梯度规范化,来制定新的政策,既能实现高可解释性,又能计算生成突出度地图的效率; 我们还发现,我们的方法是提高RL政策对多重对抗性攻击的稳健性; 我们在MiniGrid(牵引物体)、Atari(Breakout)和CARLA自主驾驶三项任务上进行实验,以展示我们的方法的重要性和有效性。

0

相关内容

正则化项

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

APN转染BMSCs抑制TSP-1/TGF-β1介导的糖尿病心肌纤维化的研究

国家自然科学基金

0+阅读 · 2014年12月31日

INF-γ通过CIITA调控PPARγ转录机制及其在2型糖尿病中意义的探讨

国家自然科学基金

0+阅读 · 2013年12月31日

H钝化Si(100)表面卟啉分子隧道结电致发光特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

有机功能分子界面修饰对有机/无机复合太阳电池性能的调控

国家自然科学基金

0+阅读 · 2012年12月31日

Kupffer细胞上GITRL在大鼠肝移植免疫耐受重建中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nodal-ALK7介导的β细胞内源性调节对β细胞功能的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

早期乳腺癌保乳术后基于复发风险和组织病理的个体化亚临床病灶的研究

国家自然科学基金

0+阅读 · 2011年12月31日

禽多杀性巴氏杆菌Cp39粘附蛋白和荚膜的关系及其受体研究

国家自然科学基金

0+阅读 · 2009年12月31日

Hippo-YAP信号通路在hPCSC分化过程中的作用和机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

Arxiv

0+阅读 · 2022年7月7日

A Local Optimization Framework for Multi-Objective Ergodic Search

Arxiv

0+阅读 · 2022年7月6日

An Empirical Study of Implicit Regularization in Deep Offline RL

Arxiv

0+阅读 · 2022年7月5日

Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization

Arxiv

0+阅读 · 2022年7月5日

A Deep Learning Approach for the solution of Probability Density Evolution of Stochastic Systems

Arxiv

0+阅读 · 2022年7月5日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

Arxiv

0+阅读 · 2022年7月7日

A Local Optimization Framework for Multi-Objective Ergodic Search

Arxiv

0+阅读 · 2022年7月6日

An Empirical Study of Implicit Regularization in Deep Offline RL

Arxiv

0+阅读 · 2022年7月5日

Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization

Arxiv

0+阅读 · 2022年7月5日

A Deep Learning Approach for the solution of Probability Density Evolution of Stochastic Systems

Arxiv

0+阅读 · 2022年7月5日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

相关基金

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

APN转染BMSCs抑制TSP-1/TGF-β1介导的糖尿病心肌纤维化的研究

国家自然科学基金

0+阅读 · 2014年12月31日

INF-γ通过CIITA调控PPARγ转录机制及其在2型糖尿病中意义的探讨

国家自然科学基金

0+阅读 · 2013年12月31日

H钝化Si(100)表面卟啉分子隧道结电致发光特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

有机功能分子界面修饰对有机/无机复合太阳电池性能的调控

国家自然科学基金

0+阅读 · 2012年12月31日

Kupffer细胞上GITRL在大鼠肝移植免疫耐受重建中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nodal-ALK7介导的β细胞内源性调节对β细胞功能的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

早期乳腺癌保乳术后基于复发风险和组织病理的个体化亚临床病灶的研究

国家自然科学基金

0+阅读 · 2011年12月31日

禽多杀性巴氏杆菌Cp39粘附蛋白和荚膜的关系及其受体研究

国家自然科学基金

0+阅读 · 2009年12月31日

Hippo-YAP信号通路在hPCSC分化过程中的作用和机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员