利用强化学习清洁机器人计划路径 (Path Planning of Cleaning Robot with Reinforcement Learning) - 专知论文

会员服务 ·

0

Learning · 回合 · 路径 · Performer · 强化学习 ·

2022 年 8 月 17 日

Path Planning of Cleaning Robot with Reinforcement Learning

翻译：利用强化学习清洁机器人计划路径

Woohyeon Moon,Bumgeun Park,Sarvar Hussain Nengroo,Taeyoung Kim,Dongsoo Har

from arxiv, 7 pages with 11 figures

Recently, as the demand for cleaning robots has steadily increased, therefore household electricity consumption is also increasing. To solve this electricity consumption issue, the problem of efficient path planning for cleaning robot has become important and many studies have been conducted. However, most of them are about moving along a simple path segment, not about the whole path to clean all places. As the emerging deep learning technique, reinforcement learning (RL) has been adopted for cleaning robot. However, the models for RL operate only in a specific cleaning environment, not the various cleaning environment. The problem is that the models have to retrain whenever the cleaning environment changes. To solve this problem, the proximal policy optimization (PPO) algorithm is combined with an efficient path planning that operates in various cleaning environments, using transfer learning (TL), detection nearest cleaned tile, reward shaping, and making elite set methods. The proposed method is validated with an ablation study and comparison with conventional methods such as random and zigzag. The experimental results demonstrate that the proposed method achieves improved training performance and increased convergence speed over the original PPO. And it also demonstrates that this proposed method is better performance than conventional methods (random, zigzag).

翻译：最近,随着清洁机器人的需求稳步增加,家庭用电量也在增加。为了解决这一电力消耗问题,清洁机器人的有效道路规划问题已经变得重要,而且已经进行了许多研究。然而,大多数都是在简单的路径段上前进,而不是清理所有地方的整个路径。随着正在形成的深层学习技术,清洁机器人采用了强化学习(RL)方法。然而,清洁机器人的模式只在特定的清洁环境中运行,而不是在各种清洁环境中运行。问题是,当清洁环境发生变化时,模型必须重新培训。要解决这个问题,准产品政策优化算法与有效路径规划相结合,在各种清洁环境中运作,使用转移学习(TL),探测最近的清洁瓷砖,奖励制成,以及制定精英定型方法。拟议的方法经过一个通缩研究和与随机和兹格扎格等传统方法的比较而得到验证。实验结果表明,拟议的方法提高了培训绩效,提高了原PPPPO的趋同速度。它还表明,拟议的方法比常规方法(randomrag, ziggrag)。

0

相关内容

Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Riemann-Hilbert 方法的一致渐近分析及其应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

铜纳米材料的控制合成与催化性质研究及在催化Ullmann 类型偶联反应的应用

国家自然科学基金

0+阅读 · 2015年12月31日

Cys C/Wnt通路在高龄急性心肌梗死心肌损伤中的作用及新机制

国家自然科学基金

0+阅读 · 2015年12月31日

新型太阳能电池材料CH3NH3PbX3（X=Cl，Br，I）的晶体生长和性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新相关基因Bmi-1在大肠癌肝转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

BNT基无铅压电陶瓷的低场诱发形变性能调控

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Cr-Ti-Y-O型纳米团簇氧化物弥散强化CLAM钢的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Yb:YAG/Cr4+:YAG复合材料的高峰值功率被动调Q微片激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌基因Pim-1对细胞衰老的调节作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Meta Reinforcement Learning for Optimal Design of Legged Robots

Arxiv

1+阅读 · 2022年10月6日

Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning

Arxiv

0+阅读 · 2022年10月6日

CaiRL: A High-Performance Reinforcement Learning Environment Toolkit

Arxiv

0+阅读 · 2022年10月3日

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年10月3日

Obstacle Avoidance for Robotic Manipulator in Joint Space via Improved Proximal Policy Optimization

Arxiv

0+阅读 · 2022年10月3日

FRIDA: A Collaborative Robot Painter with a Differentiable, Real2Sim2Real Planning Environment

Arxiv

0+阅读 · 2022年10月3日

Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot

Arxiv

0+阅读 · 2022年10月1日

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

Arxiv

0+阅读 · 2022年9月30日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

相关VIP内容

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

Meta Reinforcement Learning for Optimal Design of Legged Robots

Arxiv

1+阅读 · 2022年10月6日

Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning

Arxiv

0+阅读 · 2022年10月6日

CaiRL: A High-Performance Reinforcement Learning Environment Toolkit

Arxiv

0+阅读 · 2022年10月3日

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年10月3日

Obstacle Avoidance for Robotic Manipulator in Joint Space via Improved Proximal Policy Optimization

Arxiv

0+阅读 · 2022年10月3日

FRIDA: A Collaborative Robot Painter with a Differentiable, Real2Sim2Real Planning Environment

Arxiv

0+阅读 · 2022年10月3日

Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot

Arxiv

0+阅读 · 2022年10月1日

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

Arxiv

0+阅读 · 2022年9月30日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

相关基金

Riemann-Hilbert 方法的一致渐近分析及其应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

铜纳米材料的控制合成与催化性质研究及在催化Ullmann 类型偶联反应的应用

国家自然科学基金

0+阅读 · 2015年12月31日

Cys C/Wnt通路在高龄急性心肌梗死心肌损伤中的作用及新机制

国家自然科学基金

0+阅读 · 2015年12月31日

新型太阳能电池材料CH3NH3PbX3（X=Cl，Br，I）的晶体生长和性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新相关基因Bmi-1在大肠癌肝转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

BNT基无铅压电陶瓷的低场诱发形变性能调控

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Cr-Ti-Y-O型纳米团簇氧化物弥散强化CLAM钢的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Yb:YAG/Cr4+:YAG复合材料的高峰值功率被动调Q微片激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌基因Pim-1对细胞衰老的调节作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员