局部控制环境中反转弧弧度的等级性贝叶斯模型 (A Hierarchical Bayesian model for Inverse RL in Partially-Controlled Environments) - 专知论文

会员服务 ·

0

回合 · MoDELS · 可辨认的 · 逆强化学习 · 多样性 ·

2021 年 7 月 13 日

A Hierarchical Bayesian model for Inverse RL in Partially-Controlled Environments

翻译：局部控制环境中反转弧弧度的等级性贝叶斯模型

Kenneth Bogert,Prashant Doshi

from arxiv, 8 pages, 10 figures

Robots learning from observations in the real world using inverse reinforcement learning (IRL) may encounter objects or agents in the environment, other than the expert, that cause nuisance observations during the demonstration. These confounding elements are typically removed in fully-controlled environments such as virtual simulations or lab settings. When complete removal is impossible the nuisance observations must be filtered out. However, identifying the source of observations when large amounts of observations are made is difficult. To address this, we present a hierarchical Bayesian model that incorporates both the expert's and the confounding elements' observations thereby explicitly modeling the diverse observations a robot may receive. We extend an existing IRL algorithm originally designed to work under partial occlusion of the expert to consider the diverse observations. In a simulated robotic sorting domain containing both occlusion and confounding elements, we demonstrate the model's effectiveness. In particular, our technique outperforms several other comparative methods, second only to having perfect knowledge of the subject's trajectory.

翻译：机器人在现实世界中用反向强化学习(IRL)从观测中学习时,除了专家外,在环境中可能会遇到在演示期间引起干扰观测的物体或物剂。这些混杂元素通常在完全控制的环境中被清除,例如虚拟模拟或实验室设置。当不可能完全清除时,必须过滤扰动观测结果。然而,在进行大量观测时确定观测来源是困难的。要解决这个问题,我们提出一种高等级的巴伊西亚模型,既包括专家的观测,也包括混杂元素的观测结果,从而明确模拟机器人可能得到的不同观测结果。我们扩展了一种现有的IRL算法,最初设计在专家部分隔离下工作,以考虑各种观测结果。在模拟机器人分类的域中,既包含封闭因素,又包含聚合要素,我们展示模型的有效性。特别是,我们的技术超越了其他几种比较方法,其次于对主体的轨迹的完全了解。

0

相关内容

【经典书】应用离散结构，568页pdf

专知会员服务

84+阅读 · 2021年5月4日

【干货书】实体搜索，Entity-Oriented Search，358页pdf

【干货书】实体搜索，Entity-Oriented Search，358页pdf

专知会员服务

35+阅读 · 2021年4月9日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】管理统计和数据科学原理，678页pdf

【干货书】管理统计和数据科学原理，678页pdf

专知会员服务

186+阅读 · 2020年7月29日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Vision Transformer for Learning Driving Policies in Complex Multi-Agent Environments

Vision Transformer for Learning Driving Policies in Complex Multi-Agent Environments

Arxiv

0+阅读 · 2021年9月14日

Learning and Leveraging Environmental Features to Improve Robot Awareness

Arxiv

0+阅读 · 2021年9月13日

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning

Arxiv

0+阅读 · 2021年9月13日

Federated Ensemble Model-based Reinforcement Learning

Arxiv

0+阅读 · 2021年9月12日

Robot Navigation in Irregular Environments with Local Elevation Estimation using Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年9月10日

Projected State-action Balancing Weights for Offline Reinforcement Learning

Arxiv

0+阅读 · 2021年9月10日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Arxiv

5+阅读 · 2019年6月18日

IQA: Visual Question Answering in Interactive Environments

Arxiv

5+阅读 · 2018年4月5日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

VIP会员

文章信息

相关主题

逆强化学习

相关VIP内容

【经典书】应用离散结构，568页pdf

专知会员服务

84+阅读 · 2021年5月4日

【干货书】实体搜索，Entity-Oriented Search，358页pdf

【干货书】实体搜索，Entity-Oriented Search，358页pdf

专知会员服务

35+阅读 · 2021年4月9日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】管理统计和数据科学原理，678页pdf

【干货书】管理统计和数据科学原理，678页pdf

专知会员服务

186+阅读 · 2020年7月29日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Vision Transformer for Learning Driving Policies in Complex Multi-Agent Environments

Vision Transformer for Learning Driving Policies in Complex Multi-Agent Environments

Arxiv

0+阅读 · 2021年9月14日

Learning and Leveraging Environmental Features to Improve Robot Awareness

Arxiv

0+阅读 · 2021年9月13日

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning

Arxiv

0+阅读 · 2021年9月13日

Federated Ensemble Model-based Reinforcement Learning

Arxiv

0+阅读 · 2021年9月12日

Robot Navigation in Irregular Environments with Local Elevation Estimation using Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年9月10日

Projected State-action Balancing Weights for Offline Reinforcement Learning

Arxiv

0+阅读 · 2021年9月10日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Arxiv

5+阅读 · 2019年6月18日

IQA: Visual Question Answering in Interactive Environments

Arxiv

5+阅读 · 2018年4月5日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

微信扫码咨询专知VIP会员