展望强化学习的时空-空间因果关系解释 (Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning)

Deep reinforcement learning (RL) agents are becoming increasingly proficient in a range of complex control tasks. However, the agent's behavior is usually difficult to interpret due to the introduction of black-box function, making it difficult to acquire the trust of users. Although there have been some interesting interpretation methods for vision-based RL, most of them cannot uncover temporal causal information, raising questions about their reliability. To address this problem, we present a temporal-spatial causal interpretation (TSCI) model to understand the agent's long-term behavior, which is essential for sequential decision-making. TSCI model builds on the formulation of temporal causality, which reflects the temporal causal relations between sequential observations and decisions of RL agent. Then a separate causal discovery network is employed to identify temporal-spatial causal features, which are constrained to satisfy the temporal causality. TSCI model is applicable to recurrent agents and can be used to discover causal features with high efficiency once trained. The empirical results show that TSCI model can produce high-resolution and sharp attention masks to highlight task-relevant temporal-spatial information that constitutes most evidence about how vision-based RL agents make sequential decisions. In addition, we further demonstrate that our method is able to provide valuable causal interpretations for vision-based RL agents from the temporal perspective.

翻译：深度强化学习(RL)剂在一系列复杂的控制任务中越来越熟练。然而,由于引入黑盒功能,该剂的行为通常难以解释,因此很难解释,因此难以获得用户的信任。虽然对基于愿景的RL有一些有趣的解释方法,但其中多数无法发现时间因果信息,引起对其可靠性的疑问。为解决这一问题,我们提出了一个时间空间因果解释模型,以了解该剂的长期行为,这是连续决策所必不可少的。TSCI模型建立在时间因果关系的公式上,反映了顺序观测和RL剂决定之间的时间因果关系。随后,一个单独的因果发现网络被用来确定时间空间因果特性,这些特性受时间因果特性制约,满足时间因果特性。TSCI模型适用于经常性剂,一旦经过培训,就可以用来发现高效率的因果特性。经验显示,TSCI模型能够产生高分辨率和尖锐的注意面罩,突出与任务相关的时间空间信息,从而最能证明基于愿景的RL剂的代谢性解释方法能够进一步显示我们基于基于愿景的RL剂的连续解释方法。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【Google DeepMind & 斯坦福 AAAI2020】Options of Interest Temporal Abstraction with Interest Function

专知会员服务

5+阅读 · 2020年1月5日