In this work a general framework is proposed to support the development of software systems that are able to adapt their behaviour according to the operating environment changes. The proposed approach, named REPTILE, works in a complete proactive manner and relies on Deep Reinforcement Learning-based agents to react to events, referred as novelties, that can affect the expected behaviour of the system. In our framework, two types of novelties are taken into account: those related to the context/environment and those related to the physical architecture itself. The framework, predicting those novelties before their occurrence, extracts time-changing models of the environment and uses a suitable Markov Decision Process to deal with the real-time setting. Moreover, the architecture of our RL agent evolves based on the possible actions that can be taken.
翻译:在这项工作中,提出一个总体框架,支持开发能够根据操作环境变化调整其行为的软件系统,拟议的方法称为REPTILE,以完全积极主动的方式运作,依靠基于深强化学习的代理机构对可能影响系统预期行为的事件作出反应,称为新颖,在我们的框架内,考虑到两种新颖性:与背景/环境有关的新颖性以及与物理结构本身有关的新颖性;在出现之前预测这些新颖性,提取时间变化的环境模型,利用适当的Markov决策程序处理实时环境;此外,我们的RL代理机构的结构根据可能采取的行动而演变。