DCMRTA:复杂环境中分散的多机器人任务分配和导航 (DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments) - 专知论文

会员服务 ·

0

Performer · Extensibility · 讲稿 · Markov · 回合 ·

2022 年 9 月 7 日

DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments

翻译：DCMRTA:复杂环境中分散的多机器人任务分配和导航

Aakriti Agrawal,Senthil Hariharan,Amrit Singh Bedi,Dinesh Manocha

We present a novel reinforcement learning (RL) based task allocation and decentralized navigation algorithm for mobile robots in warehouse environments. Our approach is designed for scenarios in which multiple robots are used to perform various pick up and delivery tasks. We consider the problem of joint decentralized task allocation and navigation and present a two level approach to solve it. At the higher level, we solve the task allocation by formulating it in terms of Markov Decision Processes and choosing the appropriate rewards to minimize the Total Travel Delay (TTD). At the lower level, we use a decentralized navigation scheme based on ORCA that enables each robot to perform these tasks in an independent manner, and avoid collisions with other robots and dynamic obstacles. We combine these lower and upper levels by defining rewards for the higher level as the feedback from the lower level navigation algorithm. We perform extensive evaluation in complex warehouse layouts with large number of agents and highlight the benefits over state-of-the-art algorithms based on myopic pickup distance minimization and regret-based task selection. We observe improvement up to 14% in terms of task completion time and up-to 40% improvement in terms of computing collision-free trajectories for the robots.

翻译：我们为仓库环境中的移动机器人提供了一种新的强化学习(RL)任务分配和分散导航算法。我们的方法是针对多种机器人被用于执行各种接送和交付任务的情况设计的。我们考虑了联合分散分配任务和导航的问题,并提出了解决该问题的两级办法。在较高层次上,我们用Markov 决策程序来制定任务分配,并选择适当的奖励来尽量减少总旅行延迟。在较低层次上,我们使用基于ORCA的分散导航办法,使每个机器人能够独立地执行这些任务,避免与其他机器人和动态障碍发生碰撞。我们将这些高低层次和高层次结合起来,方法是确定较高层次的奖励,作为较低层次导航算法的反馈。我们用大量物剂对复杂的仓库布局进行广泛的评价,并突出基于近似的拉皮距离最小化和遗憾任务选择的先进算法的好处。我们观察到在任务完成时间方面改进到14%,在计算无碰撞轨道方面改进到40%。

0

相关内容

Performer

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

整合MRI药代动力学灌注参数与ADC值评估乳腺癌新辅助化疗残留病灶及乏氧与组织间隙液压微环境

国家自然科学基金

0+阅读 · 2015年12月31日

羰基化合物激发态势能面交叉动力学的共振拉曼光谱和CASSCF计算研究

国家自然科学基金

0+阅读 · 2014年12月31日

GAPDH在乳腺癌细胞EMT及乳腺癌转移中的表达与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间分辨及二维相关红外光谱在快离子导体离子迁移微观动力学中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

老年性痴呆脑微环境对海马移植NSCs的作用及针刺干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

奖赏环路在双相障碍发病中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

CLU,CR1，PICALM基因多态性及相关因素与内蒙古蒙、汉族阿尔茨海默病人群的病例-对照研究

国家自然科学基金

0+阅读 · 2012年12月31日

放射性认知功能障碍发生过程中海马齿状回神经元变化特征的研究

国家自然科学基金

1+阅读 · 2011年12月31日

乳腺癌化疗所致记忆障碍的脑机制及其康复的研究

国家自然科学基金

0+阅读 · 2011年12月31日

AlGaN/GaN HEMT飞秒超快特性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Muffliato: Peer-to-Peer Privacy Amplification for Decentralized Optimization and Averaging

Arxiv

0+阅读 · 2022年10月20日

Robot Navigation with Reinforcement Learned Path Generation and Fine-Tuned Motion Control

Arxiv

0+阅读 · 2022年10月19日

Enhanced Decentralized Autonomous Aerial Robot Teams with Group Planning

Arxiv

0+阅读 · 2022年10月19日

Consistent Multiclass Algorithms for Complex Metrics and Constraints

Arxiv

0+阅读 · 2022年10月19日

Optimized Data Rate Allocation for Dynamic Sensor Fusion over Resource Constrained Communication Networks

Optimized Data Rate Allocation for Dynamic Sensor Fusion over Resource Constrained Communication Networks

Arxiv

0+阅读 · 2022年10月18日

Multimodal Shared Autonomy for Social Navigation Assistance of Telepresence Robots

Arxiv

0+阅读 · 2022年10月17日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Text Classification Algorithms: A Survey

Arxiv

15+阅读 · 2019年6月25日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Muffliato: Peer-to-Peer Privacy Amplification for Decentralized Optimization and Averaging

Arxiv

0+阅读 · 2022年10月20日

Robot Navigation with Reinforcement Learned Path Generation and Fine-Tuned Motion Control

Arxiv

0+阅读 · 2022年10月19日

Enhanced Decentralized Autonomous Aerial Robot Teams with Group Planning

Arxiv

0+阅读 · 2022年10月19日

Consistent Multiclass Algorithms for Complex Metrics and Constraints

Arxiv

0+阅读 · 2022年10月19日

Optimized Data Rate Allocation for Dynamic Sensor Fusion over Resource Constrained Communication Networks

Optimized Data Rate Allocation for Dynamic Sensor Fusion over Resource Constrained Communication Networks

Arxiv

0+阅读 · 2022年10月18日

Multimodal Shared Autonomy for Social Navigation Assistance of Telepresence Robots

Arxiv

0+阅读 · 2022年10月17日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Text Classification Algorithms: A Survey

Arxiv

15+阅读 · 2019年6月25日

相关基金

整合MRI药代动力学灌注参数与ADC值评估乳腺癌新辅助化疗残留病灶及乏氧与组织间隙液压微环境

国家自然科学基金

0+阅读 · 2015年12月31日

羰基化合物激发态势能面交叉动力学的共振拉曼光谱和CASSCF计算研究

国家自然科学基金

0+阅读 · 2014年12月31日

GAPDH在乳腺癌细胞EMT及乳腺癌转移中的表达与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

时间分辨及二维相关红外光谱在快离子导体离子迁移微观动力学中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

老年性痴呆脑微环境对海马移植NSCs的作用及针刺干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

奖赏环路在双相障碍发病中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

CLU,CR1，PICALM基因多态性及相关因素与内蒙古蒙、汉族阿尔茨海默病人群的病例-对照研究

国家自然科学基金

0+阅读 · 2012年12月31日

放射性认知功能障碍发生过程中海马齿状回神经元变化特征的研究

国家自然科学基金

1+阅读 · 2011年12月31日

乳腺癌化疗所致记忆障碍的脑机制及其康复的研究

国家自然科学基金

0+阅读 · 2011年12月31日

AlGaN/GaN HEMT飞秒超快特性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员