We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description. The discrete time step evolution of variables in RDDL is described by conditional probability functions, which fits naturally into the Gym step scheme. Furthermore, since RDDL is a lifted description, the modification and scaling up of environments to support multiple entities and different configurations becomes trivial rather than a tedious process prone to errors. We hope that pyRDDLGym will serve as a new wind in the reinforcement learning community by enabling easy and rapid development of benchmarks due to the unique expressive power of RDDL. By providing explicit access to the model in the RDDL description, pyRDDLGym can also facilitate research on hybrid approaches for learning from interaction while leveraging model knowledge. We present the design and built-in examples of pyRDDLGym, and the additions made to the RDDL language that were incorporated into the framework.
翻译:我们介绍了PyRDDDDL 解析性描述中OpenAI Gym 环境自动生成的PyRDDLDDDDL Gym Python框架。RDDL 中变量的离散时间步骤演进用有条件概率函数描述,这自然适合 Gym 步骤方案。此外,由于RDDL 是一个解除描述,因此,RDDL 的变量的离散时间步骤进化功能被描述为自然适合 Gym 步骤方案。由于RDDL 的描述,PyRDDDLGym 能够提供明确访问模型的机会,从而便利在利用模型知识的同时从互动中学习的混合方法的研究。我们介绍了PyRDDDLGym 的设计和嵌入实例,以及纳入框架的RDDL 语言的添加内容,我们希望pyRDDL Gym 能够成为加强学习界的新风。