利用深Q学习进行模块化生产背景下的车辆管理 (Vehicle management in a modular production context using Deep Q-Learning) - 专知论文

会员服务 ·

0

回合 · Performer · Automator · 离散化 · 稳健性 ·

2022 年 5 月 6 日

Vehicle management in a modular production context using Deep Q-Learning

翻译：利用深Q学习进行模块化生产背景下的车辆管理

Lucain Pouget,Timo Hasenbichler,Jakob Auer,Klaus Lichtenegger,Andreas Windisch

We investigate the feasibility of deploying Deep-Q based deep reinforcement learning agents to job-shop scheduling problems in the context of modular production facilities, using discrete event simulations for the environment. These environments are comprised of a source and sink for the parts to be processed, as well as (several) workstations. The agents are trained to schedule automated guided vehicles to transport the parts back and forth between those stations in an optimal fashion. Starting from a very simplistic setup, we increase the complexity of the environment and compare the agents' performances with well established heuristic approaches, such as first-in-first-out based agents, cost tables and a nearest-neighbor approach. We furthermore seek particular configurations of the environments in which the heuristic approaches struggle, to investigate to what degree the Deep-Q agents are affected by these challenges. We find that Deep-Q based agents show comparable performance as the heuristic baselines. Furthermore, our findings suggest that the DRL agents exhibit an increased robustness to noise, as compared to the conventional approaches. Overall, we find that DRL agents constitute a valuable approach for this type of scheduling problems.

翻译：我们利用环境的离散事件模拟,调查在模块生产设施中部署基于深Q的深强化学习剂的可行性,以便在模块生产设施中将工场安排为工作单位的可行性,这些环境包括处理部件的源和汇,以及(几个)工作站。这些代理人受过训练,可以以最佳的方式安排自动制导车辆在这些站点之间往返运输部件的最佳方式。从简单化的设置开始,我们增加了环境的复杂性,并将这些代理人的性能与既定的超常方法,例如先入为主的代理人、成本表和近邻方法进行比较。我们还寻求对超常方法所挣扎的环境的具体配置,以调查深Q代理受到这些挑战影响的程度。我们发现,基于深Q的代理人表现出与超常基线相当的性能。此外,我们的研究结果表明,与常规方法相比,DL代理人对噪音表现出更大的活力。总体而言,我们发现DL代理人是这类列表问题的一种宝贵方法。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

几类高阶非线性行波方程的精确解,分支和复杂动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

局部条件下的二阶哈密顿系统同宿轨的存在性与多重性

国家自然科学基金

0+阅读 · 2014年12月31日

MiR-181c对心脏室间隔缺损发生中心肌细胞功能紊乱的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

非凸映射的Robinson-Ursescu定理及度量次正则性

国家自然科学基金

0+阅读 · 2012年12月31日

Stat3抑制myocardin诱导心肌肥厚的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

自动化集装箱码头管理与设备调度一体化的装卸作业计划建模与优化研究

国家自然科学基金

0+阅读 · 2009年12月31日

2n配子对马蹄莲种质创新和PGI障碍克服的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于协同学的并行多层次反馈图像理解研究

国家自然科学基金

1+阅读 · 2008年12月31日

Centralized and Decentralized Control in Modular Robots and Their Effect on Morphology

Arxiv

0+阅读 · 2022年6月27日

Extracting Large Scale Spatio-Temporal Descriptions from Social Media

Arxiv

0+阅读 · 2022年6月27日

Automatic Generation of Product-Image Sequence in E-commerce

Arxiv

0+阅读 · 2022年6月26日

Socially-Compatible Behavior Design of Autonomous Vehicles with Verification on Real Human Data

Arxiv

0+阅读 · 2022年6月24日

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Arxiv

0+阅读 · 2022年6月23日

A Belief Propagation Solution for Beam Coordination in MmWave Vehicular Networks

A Belief Propagation Solution for Beam Coordination in MmWave Vehicular Networks

Arxiv

0+阅读 · 2022年6月23日

Handling Trust in A Cloud Based Multi Agent System

Arxiv

0+阅读 · 2022年6月22日

Collective Intelligence for Deep Learning: A Survey of Recent Developments

Arxiv

22+阅读 · 2021年12月22日

On games and simulators as a platform for development of artificial intelligence for command and control

On games and simulators as a platform for development of artificial intelligence for command and control

Arxiv

88+阅读 · 2021年10月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Centralized and Decentralized Control in Modular Robots and Their Effect on Morphology

Arxiv

0+阅读 · 2022年6月27日

Extracting Large Scale Spatio-Temporal Descriptions from Social Media

Arxiv

0+阅读 · 2022年6月27日

Automatic Generation of Product-Image Sequence in E-commerce

Arxiv

0+阅读 · 2022年6月26日

Socially-Compatible Behavior Design of Autonomous Vehicles with Verification on Real Human Data

Arxiv

0+阅读 · 2022年6月24日

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Arxiv

0+阅读 · 2022年6月23日

A Belief Propagation Solution for Beam Coordination in MmWave Vehicular Networks

A Belief Propagation Solution for Beam Coordination in MmWave Vehicular Networks

Arxiv

0+阅读 · 2022年6月23日

Handling Trust in A Cloud Based Multi Agent System

Arxiv

0+阅读 · 2022年6月22日

Collective Intelligence for Deep Learning: A Survey of Recent Developments

Arxiv

22+阅读 · 2021年12月22日

On games and simulators as a platform for development of artificial intelligence for command and control

On games and simulators as a platform for development of artificial intelligence for command and control

Arxiv

88+阅读 · 2021年10月21日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

几类高阶非线性行波方程的精确解,分支和复杂动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

局部条件下的二阶哈密顿系统同宿轨的存在性与多重性

国家自然科学基金

0+阅读 · 2014年12月31日

MiR-181c对心脏室间隔缺损发生中心肌细胞功能紊乱的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

非凸映射的Robinson-Ursescu定理及度量次正则性

国家自然科学基金

0+阅读 · 2012年12月31日

Stat3抑制myocardin诱导心肌肥厚的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

自动化集装箱码头管理与设备调度一体化的装卸作业计划建模与优化研究

国家自然科学基金

0+阅读 · 2009年12月31日

2n配子对马蹄莲种质创新和PGI障碍克服的研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于协同学的并行多层次反馈图像理解研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员