学习探索感知任务的信息轨迹和样本收集 (Learning to Explore Informative Trajectories and Samples for Embodied Perception) - 专知论文

会员服务 ·

0

INFORMS · 样本 · Learning · Agent · MoDELS ·

2023 年 3 月 20 日

Learning to Explore Informative Trajectories and Samples for Embodied Perception

翻译：学习探索感知任务的信息轨迹和样本收集

Ya Jing,Tao Kong

from arxiv, To be published in IEEE International Conference on Robotics and Automation (ICRA), 2023

We are witnessing significant progress on perception models, specifically those trained on large-scale internet images. However, efficiently generalizing these perception models to unseen embodied tasks is insufficiently studied, which will help various relevant applications (e.g., home robots). Unlike static perception methods trained on pre-collected images, the embodied agent can move around in the environment and obtain images of objects from any viewpoints. Therefore, efficiently learning the exploration policy and collection method to gather informative training samples is the key to this task. To do this, we first build a 3D semantic distribution map to train the exploration policy self-supervised by introducing the semantic distribution disagreement and the semantic distribution uncertainty rewards. Note that the map is generated from multi-view observations and can weaken the impact of misidentification from an unfamiliar viewpoint. Our agent is then encouraged to explore the objects with different semantic distributions across viewpoints, or uncertain semantic distributions. With the explored informative trajectories, we propose to select hard samples on trajectories based on the semantic distribution uncertainty to reduce unnecessary observations that can be correctly identified. Experiments show that the perception model fine-tuned with our method outperforms the baselines trained with other exploration policies. Further, we demonstrate the robustness of our method in real-robot experiments.

翻译：我们正在见证感知模型方面的重大进展，特别是那些在大规模互联网图像上训练的模型。然而，将这些感知模型有效地推广到未见过的实体任务尚未得到充分研究，这将有助于各种相关应用（例如家庭机器人）。不同于在预先收集的图像上训练的静态感知方法，实体代理可以在环境中移动，并从任何视角获取物体的图像。因此，有效地学习探索策略和收集方法以收集信息丰富的训练样本是这项任务的关键。为此，我们首先建立了一个3D语义分布图，以通过引入语义分布不一致性和语义分布不确定性奖励自监督地训练探索策略。请注意，该地图是从多视角观察生成的，可以削弱来自不熟悉视角的误识别的影响。我们的代理随后被鼓励探索具有不同语义分布或不确定语义分布的对象。通过探索信息丰富的轨迹，我们建议根据语义分布的不确定性从轨迹选择难样本，以减少可以正确识别的不必要观察。实验表明，使用我们的方法微调的感知模型优于使用其他探索策略训练的基线。此外，我们在真实机器人实验中展示了我们方法的鲁棒性。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

专知会员服务

14+阅读 · 2020年3月27日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

小样本学习（Few-shot Learning）综述

小样本学习（Few-shot Learning）综述

机器之心

18+阅读 · 2019年4月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

中国田鼠亚科 Microtini族(Rodentia: Cricetidae: Arvicolinae)的分类与系统发育研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向海洋数据收集和移动目标搜索的多AUV航迹规划

国家自然科学基金

5+阅读 · 2014年12月31日

中国产石竹科无心菜属（Arenaria）的分类学研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向未知环境探索的类别独立对象检测

国家自然科学基金

0+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

空间群组目标相似度计算模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

医学图像分割中面向目标的形状统计与边界特征学习

国家自然科学基金

3+阅读 · 2011年12月31日

自主车辆的高质量三维场景认知与导航避障控制方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于视觉感知的中国书画图像语义自动分类研究

国家自然科学基金

0+阅读 · 2009年12月31日

Knowledge-enhanced Agents for Interactive Text Games

Arxiv

0+阅读 · 2023年5月8日

Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing

Arxiv

0+阅读 · 2023年5月8日

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Arxiv

0+阅读 · 2023年5月7日

Truncating Trajectories in Monte Carlo Reinforcement Learning

Arxiv

0+阅读 · 2023年5月7日

Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation

Arxiv

0+阅读 · 2023年5月6日

Composite Motion Learning with Task Control

Arxiv

0+阅读 · 2023年5月5日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Arxiv

28+阅读 · 2022年9月30日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Multimodality in Meta-Learning: A Comprehensive Survey

Arxiv

37+阅读 · 2021年9月28日

Building Intelligent Autonomous Navigation Agents

Arxiv

25+阅读 · 2021年6月25日

VIP会员

文章信息

相关主题

相关VIP内容

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

【ICML2020投稿论文-CMU-DeepMind-Google】用于评估跨语言泛化的大规模多语言多任务基准

专知会员服务

14+阅读 · 2020年3月27日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

小样本学习（Few-shot Learning）综述

小样本学习（Few-shot Learning）综述

机器之心

18+阅读 · 2019年4月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Knowledge-enhanced Agents for Interactive Text Games

Arxiv

0+阅读 · 2023年5月8日

Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing

Arxiv

0+阅读 · 2023年5月8日

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Arxiv

0+阅读 · 2023年5月7日

Truncating Trajectories in Monte Carlo Reinforcement Learning

Arxiv

0+阅读 · 2023年5月7日

Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation

Arxiv

0+阅读 · 2023年5月6日

Composite Motion Learning with Task Control

Arxiv

0+阅读 · 2023年5月5日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Arxiv

28+阅读 · 2022年9月30日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Multimodality in Meta-Learning: A Comprehensive Survey

Arxiv

37+阅读 · 2021年9月28日

Building Intelligent Autonomous Navigation Agents

Arxiv

25+阅读 · 2021年6月25日

相关基金

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

中国田鼠亚科 Microtini族(Rodentia: Cricetidae: Arvicolinae)的分类与系统发育研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向海洋数据收集和移动目标搜索的多AUV航迹规划

国家自然科学基金

5+阅读 · 2014年12月31日

中国产石竹科无心菜属（Arenaria）的分类学研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向未知环境探索的类别独立对象检测

国家自然科学基金

0+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

空间群组目标相似度计算模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

医学图像分割中面向目标的形状统计与边界特征学习

国家自然科学基金

3+阅读 · 2011年12月31日

自主车辆的高质量三维场景认知与导航避障控制方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于视觉感知的中国书画图像语义自动分类研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员