审查、分析和设计一个全面深入强化学习框架 (Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework)

The integration of deep learning to reinforcement learning (RL) has enabled RL to perform efficiently in high-dimensional environments. Deep RL methods have been applied to solve many complex real-world problems in recent years. However, development of a deep RL-based system is challenging because of various issues such as the selection of a suitable deep RL algorithm, its network configuration, training time, training methods, and so on. This paper proposes a comprehensive software framework that not only plays a vital role in designing a connect-the-dots deep RL architecture but also provides a guideline to develop a realistic RL application in a short time span. We have designed and developed a deep RL-based software framework that strictly ensures flexibility, robustness, and scalability. By inheriting the proposed architecture, software managers can foresee any challenges when designing a deep RL-based system. As a result, they can expedite the design process and actively control every stage of software development, which is especially critical in agile development environments. To enforce generalization, the proposed architecture does not depend on a specific RL algorithm, a network configuration, the number of agents, or the type of agents. Using our framework, software developers can develop and integrate new RL algorithms or new types of agents, and can flexibly change network configuration or the number of agents.

翻译：深层次学习与强化学习相结合(RL)使RL能够在高维环境中高效运行。深层RL方法近年来被用于解决许多复杂的现实世界问题。然而,深层RL系统的发展具有挑战性,因为各种问题,例如选择合适的深层次RL算法、其网络配置、培训时间、培训方法等等。本文件提议了一个全面的软件框架,不仅在设计连接点深度RL结构方面发挥着关键作用,而且还为在短时间内开发现实的RL应用程序提供了指南。我们设计并开发了一个深层次的基于RL软件框架,严格确保灵活性、稳健性和可扩展性。通过继承拟议的结构,软件管理员可以在设计深层次的RL系统时预见任何挑战。因此,他们可以加快设计过程并积极控制软件开发的每个阶段,这对于发展环境尤其至关重要。为了实施一般化,拟议的结构并不依赖于具体的RL算法、网络配置、代理商数量或新型代理商或新型的动态代理商框架。使用我们的软件开发商和新版本,可以将新的代理商或新型的版本发展器。