几何加固学习:笛卡尔斯空间定向案例 (Geometric Reinforcement Learning: The Case of Cartesian Space Orientation) - 专知论文

会员服务 ·

0

Learning · 流形 · CASE · ForCES · 机器人 ·

2022 年 10 月 14 日

Geometric Reinforcement Learning: The Case of Cartesian Space Orientation

翻译：几何加固学习:笛卡尔斯空间定向案例

Naseem Alhousani,Matteo Saveriano,Ibrahim Sevinc,Talha Abdulkuddus,Hatice Kose,Fares J. Abu-Dakka

from arxiv, 9 pages, 9 figures, journal

Reinforcement learning (RL) enables an agent to learn by trial and error while interacting with a dynamic environment. Traditionally, RL is used to learn and predict Euclidean robotic manipulation skills like positions, velocities, and forces. However, in robotics, it is common to have non-Euclidean data like orientation or stiffness, and neglecting their geometric nature can adversely affect learning performance and accuracy. In this paper, we propose a novel framework for RL by using Riemannian geometry, and show how it can be applied to learn manipulation skills with a specific geometric structure (e.g., robot's orientation in the task space). The proposed framework is suitable for any policy representation and is independent of the algorithm choice. Specifically, we propose to apply policy parameterization and learning on the tangent space, then map the learned actions back to the appropriate manifold (e.g., the S3 manifold for orientation). Therefore, we introduce a geometrically grounded pre- and post-processing step into the typical RL pipeline, which opens the door to all algorithms designed for Euclidean space to learn from non-Euclidean data without changes. Experimental results, obtained both in simulation and on a real robot, support our hypothesis that learning on the tangent space is more accurate and converges to a better solution than approximating non-Euclidean data.

翻译：强化学习 (RL) 使代理商能够在与动态环境互动时通过试验和错误学习。传统上, RL 用于学习和预测欧洲精密机器人操纵技能, 如位置、速度和力量。然而, 在机器人中, 通常的做法是拥有非欧洲精密机器人数据, 如定向或僵硬, 忽视其几何性质会对学习绩效和准确性产生不利影响。在本文中, 我们通过使用里伊曼尼的几何方法为RL提出一个新的框架, 并展示如何应用它来学习特定几何结构( 例如, 机器人在任务空间的定向) 的操纵技能。拟议的框架适合任何政策代表, 并且独立于算法选择。具体地说, 我们提议在相近的空间应用政策参数化和学习, 然后将学到的动作映射回适当的管道( 如, S3 用于定向的 ) 。因此, 我们向典型的 RL 管道引入一个基于地貌的预和后处理步骤, 它将打开为 Euclideidean 空间设计的非算法的大门, 在不进行精确的模型的模拟中学习数据模拟结果, 。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

几类含∞-Laplace算子的特征值问题的研究

国家自然科学基金

1+阅读 · 2015年12月31日

四阶微分方程的谱和谱元方法

国家自然科学基金

0+阅读 · 2014年12月31日

非线性对称锥规划的内点算法及在最优控制中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

E3泛素连接酶TRIP负向调控TLR介导的炎症反应及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

完备黎曼流形上Laplace算子的特征值估计及相关研究

国家自然科学基金

0+阅读 · 2013年12月31日

流形上的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

矩阵空间的图同态与矩阵几何

国家自然科学基金

0+阅读 · 2012年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

仿射流形上的非线性分析

国家自然科学基金

0+阅读 · 2008年12月31日

A Deep Reinforcement Learning Approach to Rare Event Estimation

Arxiv

0+阅读 · 2022年11月22日

A Graphical Framework to Study the Correlation between Geometric Design and Simulation

Arxiv

0+阅读 · 2022年11月22日

Robust Geometric Metric Learning

Arxiv

0+阅读 · 2022年11月22日

Efficient Representations of Object Geometry for Reinforcement Learning of Interactive Grasping Policies

Arxiv

0+阅读 · 2022年11月20日

Redeeming Intrinsic Rewards via Constrained Optimization

Arxiv

0+阅读 · 2022年11月18日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Deep Reinforcement Learning Approach to Rare Event Estimation

Arxiv

0+阅读 · 2022年11月22日

A Graphical Framework to Study the Correlation between Geometric Design and Simulation

Arxiv

0+阅读 · 2022年11月22日

Robust Geometric Metric Learning

Arxiv

0+阅读 · 2022年11月22日

Efficient Representations of Object Geometry for Reinforcement Learning of Interactive Grasping Policies

Arxiv

0+阅读 · 2022年11月20日

Redeeming Intrinsic Rewards via Constrained Optimization

Arxiv

0+阅读 · 2022年11月18日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

相关基金

几类含∞-Laplace算子的特征值问题的研究

国家自然科学基金

1+阅读 · 2015年12月31日

四阶微分方程的谱和谱元方法

国家自然科学基金

0+阅读 · 2014年12月31日

非线性对称锥规划的内点算法及在最优控制中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

E3泛素连接酶TRIP负向调控TLR介导的炎症反应及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

完备黎曼流形上Laplace算子的特征值估计及相关研究

国家自然科学基金

0+阅读 · 2013年12月31日

流形上的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

矩阵空间的图同态与矩阵几何

国家自然科学基金

0+阅读 · 2012年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

仿射流形上的非线性分析

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员