Reinforcement learning (RL) is currently used in various real-life applications. RL-based solutions have the potential to generically address problems, including the ones that are difficult to solve with heuristics and meta-heuristics and, in addition, the set of problems and issues where some intelligent or cognitive approach is required. However, reinforcement learning agents require a not straightforward design and have important design issues. RL agent design issues include the target problem modeling, state-space explosion, the training process, and agent efficiency. Research currently addresses these issues aiming to foster RL dissemination. A BAM model, in summary, allocates and shares resources with users. There are three basic BAM models and several hybrids that differ in how they allocate and share resources among users. This paper addresses the issue of an RL agent design and efficiency. The RL agent's objective is to allocate and share resources among users. The paper investigates how a BAM model can contribute to the RL agent design and efficiency. The AllocTC-Sharing (ATCS) model is analytically described and simulated to evaluate how it mimics the RL agent operation and how the ATCS can offload computational tasks from the RL agent. The essential argument researched is whether algorithms integrated with the RL agent design and operation have the potential to facilitate agent design and optimize its execution. The ATCS analytical model and simulation presented demonstrate that a BAM model offloads agent tasks and assists the agent's design and optimization.
翻译:强化学习(RL)目前用于各种现实生活中的应用中。基于RL的解决方案有可能一般性地解决各种问题,包括难以用超常和超常方法解决的问题,以及需要某种智能或认知方法的一组问题和问题。但是,强化学习代理机构需要的不是直接的设计,而是有重要的设计问题。RL的代理机构设计问题包括目标问题模型、州-空间爆炸、培训过程和代理机构效率。研究目前旨在解决这些问题以促进RL的传播。一个BAM模型,在摘要中,与用户分配和分享资源。有三个基本的BAM模型和一些混合模型,在用户之间分配和分享资源方面各不相同。本文涉及的是RL的代理机构设计和效率问题。强化学习代理机构的目标是在用户之间分配和共享资源。论文探讨了BAM模型如何有助于RL的模型设计设计和效率。AlocTC-Sharing(ATCS)模型正在进行分析描述和模拟,以评价它如何模仿RL的代理机构操作和ACTS的模拟操作和操作过程,这能帮助其BL的代理机构进行必要的设计操作。