实现RL-驱动系统可缩放核查 (Towards Scalable Verification of RL-Driven Systems)

Deep neural networks (DNNs) have gained significant popularity in recent years, becoming the state of the art in a variety of domains. In particular, deep reinforcement learning (DRL) has recently been employed to train DNNs that act as control policies for various types of real-world systems. In this work, we present the whiRL 2.0 tool, which implements a new approach for verifying complex properties of interest for such DRL systems. To demonstrate the benefits of whiRL 2.0, we apply it to case studies from the communication networks domain that have recently been used to motivate formal verification of DRL systems, and which exhibit characteristics that are conducive for scalable verification. We propose techniques for performing k-induction and automated invariant inference on such systems, and use these techniques for proving safety and liveness properties of interest that were previously impossible to verify due to the scalability barriers of prior approaches. Furthermore, we show how our proposed techniques provide insights into the inner workings and the generalizability of DRL systems. whiRL 2.0 is publicly available online.

翻译：近年来,深神经网络(DNNs)已获得显著的欢迎,成为各个领域的先进技术,特别是,最近利用深强化学习(DRL)来培训作为各种现实世界系统控制政策的DNS。在这项工作中,我们介绍了THERL 2.0工具,该工具采用新的方法来核查这些DRL系统感兴趣的复杂特性。为了展示WHERL 2.0的好处,我们将其应用于通信网络域域的案例研究,这些案例研究最近被用来激励对DRL系统进行正式核查,并展示了有利于可扩展核查的特性。我们提出了对此类系统进行 k 上传和自动变异推论的技术,并使用这些技术来证明先前由于先前方法的可扩展性障碍而无法核实的安全和活性特性。此外,我们展示了我们所提议的技术如何为DRL系统的内部工作和可普及性提供洞察力。THER 2.0可公开在线查阅。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日