数据高效强化学习和对网络交通动态进行适应性最佳周边控制 (Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics) - 专知论文

会员服务 ·

0

Learning · 控制器 · 优化器 · Lyapunov · 强化学习 ·

2022 年 9 月 13 日

Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics

翻译：数据高效强化学习和对网络交通动态进行适应性最佳周边控制

C. Chen,Y. P. Huang,W. H. K. Lam,T. L. Pan,S. C. Hsu,A. Sumalee,R. X. Zhong

Existing data-driven and feedback traffic control strategies do not consider the heterogeneity of real-time data measurements. Besides, traditional reinforcement learning (RL) methods for traffic control usually converge slowly for lacking data efficiency. Moreover, conventional optimal perimeter control schemes require exact knowledge of the system dynamics and thus would be fragile to endogenous uncertainties. To handle these challenges, this work proposes an integral reinforcement learning (IRL) based approach to learning the macroscopic traffic dynamics for adaptive optimal perimeter control. This work makes the following primary contributions to the transportation literature: (a) A continuous-time control is developed with discrete gain updates to adapt to the discrete-time sensor data. (b) To reduce the sampling complexity and use the available data more efficiently, the experience replay (ER) technique is introduced to the IRL algorithm. (c) The proposed method relaxes the requirement on model calibration in a "model-free" manner that enables robustness against modeling uncertainty and enhances the real-time performance via a data-driven RL algorithm. (d) The convergence of the IRL-based algorithms and the stability of the controlled traffic dynamics are proven via the Lyapunov theory. The optimal control law is parameterized and then approximated by neural networks (NN), which moderates the computational complexity. Both state and input constraints are considered while no model linearization is required. Numerical examples and simulation experiments are presented to verify the effectiveness and efficiency of the proposed method.

翻译：现有数据驱动和反馈交通控制战略没有考虑到实时数据测量的异质性;此外,传统的交通控制强化学习(RL)方法通常因缺乏数据效率而缓慢交汇,缺乏数据效率;此外,常规最佳周边控制方案需要系统动态的精确知识,因此对内在不确定性很脆弱;为应对这些挑战,这项工作建议采用基于综合强化学习(IRL)方法,学习宏观交通动态,以适应最佳周边控制。这项工作对运输文献作出了以下主要贡献:(a) 开发连续时间控制,对离散时间传感器数据进行更新,以适应离散时间更新;(b) 为降低取样复杂性并更有效地使用现有数据,将经验重放(ER)技术引入IRL算法。 (c) 拟议方法以“无模式”的方式放松模型校准要求,以便能够抵御模型不确定性的稳健,并通过中度RLL算法进行实时操作;(d) 以离散时间为基础的算法制算法和定式交通动态模型的稳定性,通过Lyapunov 系统化的模型化和定序模型的模型化,然后通过Lyapoprological 25化的模型化法验证法的模型化,可以证明。

0

相关内容

Learning

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

快速谱方法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

单分子的电子相干效应及量子操控

国家自然科学基金

0+阅读 · 2012年12月31日

基于糖化合物“Ferrier Carbocyclization”汞离子荧光探针的设计、合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

白光LED用高光效Re3+:(Y/Gd)3(Al/Ga)5O12荧光晶体的制备及发光性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

策略驱动的Ad hoc网络可信路由协议研究

国家自然科学基金

0+阅读 · 2010年12月31日

DegP (HtrA)的蛋白酶与分子伴侣活性之间功能转变的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

小尺寸低压高速长保持力电荷陷阱型悬浮栅存储器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Arxiv

0+阅读 · 2022年10月23日

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

Counterfactual Explanations for Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

Arxiv

0+阅读 · 2022年10月21日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Arxiv

0+阅读 · 2022年10月23日

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

Counterfactual Explanations for Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

Arxiv

0+阅读 · 2022年10月21日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

快速谱方法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

多环（杂）芳烃桥联双金属化合物的合成及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

单分子的电子相干效应及量子操控

国家自然科学基金

0+阅读 · 2012年12月31日

基于糖化合物“Ferrier Carbocyclization”汞离子荧光探针的设计、合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

白光LED用高光效Re3+:(Y/Gd)3(Al/Ga)5O12荧光晶体的制备及发光性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

策略驱动的Ad hoc网络可信路由协议研究

国家自然科学基金

0+阅读 · 2010年12月31日

DegP (HtrA)的蛋白酶与分子伴侣活性之间功能转变的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

小尺寸低压高速长保持力电荷陷阱型悬浮栅存储器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员