Intelligent Traffic Light Control System (ITLCS) is a typical Multi-Agent System (MAS), which comprises multiple roads and traffic lights.Constructing a model of MAS for ITLCS is the basis to alleviate traffic congestion. Existing approaches of MAS are largely based on Multi-Agent Deep Reinforcement Learning (MADRL). Although the Deep Neural Network (DNN) of MABRL is effective, the training time is long, and the parameters are difficult to trace. Recently, Broad Learning Systems (BLS) provided a selective way for learning in the deep neural networks by a flat network. Moreover, Broad Reinforcement Learning (BRL) extends BLS in Single Agent Deep Reinforcement Learning (SADRL) problem with promising results. However, BRL does not focus on the intricate structures and interaction of agents. Motivated by the feature of MADRL and the issue of BRL, we propose a Multi-Agent Broad Reinforcement Learning (MABRL) framework to explore the function of BLS in MAS. Firstly, unlike most existing MADRL approaches, which use a series of deep neural networks structures, we model each agent with broad networks. Then, we introduce a dynamic self-cycling interaction mechanism to confirm the "3W" information: When to interact, Which agents need to consider, What information to transmit. Finally, we do the experiments based on the intelligent traffic light control scenario. We compare the MABRL approach with six different approaches, and experimental results on three datasets verify the effectiveness of MABRL.
翻译:智能交通灯控制系统(IMAS)是一个典型的多机构系统,由多条道路和交通灯组成。为DICCS设计一个MAS模型是缓解交通拥堵的基础。现有的MAS方法主要基于多代理深层强化学习(MADRL ) 。虽然MABL的深神经网络(DNN)是有效的,但培训时间很长,参数也难以追踪。最近,宽广学习系统(BLS)为通过一个平坦网络在深层神经网络中学习提供了一种有选择的方法。此外,广加学习(BRL)在单一代理深层强化学习(SADRL)中推广BLS模型是缓解交通拥堵塞问题的基础。然而,BRL并不关注代理人的复杂结构和互动。根据MADRL的特征和BRL问题,我们建议一个多代理宽广强化学习(MABL)框架来探索MAS的BL功能。首先,与大多数现有的MADL方法不同,在对智能网络进行一系列的对比时使用一系列深度网络的BL(BL),我们用一个模拟的自我循环的自我互动模型来验证,我们每个代理的自我互动机制。