The needs describe the necessities for a system to survive and evolve, which arouses an agent to action toward a goal, giving purpose and direction to behavior. Based on Maslow hierarchy of needs, an agent needs to satisfy a certain amount of needs at the current level as a condition to arise at the next stage -- upgrade and evolution. Especially, Deep Reinforcement Learning (DAL) can help AI agents (like robots) organize and optimize their behaviors and strategies to develop diverse Strategies based on their current state and needs (expected utilities or rewards). This paper introduces the new hierarchical needs-driven Learning systems based on DAL and investigates the implementation in the single-robot with a novel approach termed Bayesian Soft Actor-Critic (BSAC). Then, we extend this topic to the Multi-Agent systems (MAS), discussing the potential research fields and directions.
翻译:需要描述一个系统生存和演变的需要,这个系统促使人们采取行动实现一个目标,为行为提供目的和方向。根据马斯洛需求等级,一个代理需要满足当前水平的一定需求,作为下一阶段 -- -- 升级和演变 -- -- 出现的条件。特别是,深强化学习(DAL)可以帮助AI代理(像机器人一样)组织和优化其行为和战略,以便根据他们目前的状况和需求(预期的公用事业或奖励)制定不同的战略。本文介绍了以DAL为基础的新的等级驱动的学习系统,并用一种新颖的方法,即Bayesian Soft Acor-Critic (BSAC) 来调查单机器人的落实情况。然后,我们将这个专题扩大到多机构系统,讨论潜在的研究领域和方向。</s>