已取消关系 MDP 模型检查 (Lifted Model Checking for Relational MDPs)

Model checking has been developed for verifying the behaviour of systems with stochastic and non-deterministic behavior. It is used to provide guarantees about such systems. While most model checking methods focus on propositional models, various probabilistic planning and reinforcement frameworks deal with relational domains, for instance, STRIPS planning and relational Markov Decision Processes. Using propositional model checking in relational settings requires one to ground the model, which leads to the well known state explosion problem and intractability. We present pCTL-REBEL, a lifted model checking approach for verifying pCTL properties on relational MDPs. It extends REBEL, the relational Bellman update operator, which is a lifted value iteration approach for model-based relational reinforcement learning, toward relational model-checking. PCTL-REBEL is lifted, which means that rather than grounding, the model exploits symmetries and reasons at an abstract relational level. Theoretically, we show that the pCTL model checking approach is decidable for relational MDPs even for possibly infinite domains provided that the states have a bounded size. Practically, we contribute algorithms and an implementation of lifted relational model checking, and we show that the lifted approach improves the scalability of the model checking approach.

翻译：已经开发了用于核查具有随机和非决定性行为的系统行为的模型检查方法,用于为这些系统提供保障。虽然大多数模型检查方法侧重于建模模型,但各种概率规划和强化框架涉及关系领域,例如STRIP规划和关系Markov决定程序。在关系环境中使用建模检查需要一对一的模型,这会导致众所周知的状态爆炸问题和可受吸引性。我们提出了PCTL-REBEL,这是用于核查关系MDP中PCTL属性的解除模型检查方法。它扩展了REBEL,即关系贝尔曼更新操作器,这是基于模型的关系强化学习的提高值复制方法,用于进行关系模型检查。PCTL-REBEL被解除了,这意味着模型不是定位,而是在抽象关系层面上利用了对等和原因。理论上,我们表明,PCTL模式检查方法对于关系MDP的属性是可以辨别的,甚至无限的域。它扩展了REBEL,这是用于模型强化关系学习的增强值复制方法,用于进行关系模型检查。PCTL-REBEL-REBEL, 提供了一种升级的升级和升级的进度,我们展示了升级的升级的系统。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日