RSPT: 重建环境与预测轨迹，实现泛化主动物体追踪 (RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking) - 专知论文

会员服务 ·

0

泛化 · 重建 · 运动系统 · 主动追踪 · 状态表示 ·

2023 年 4 月 7 日

RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking

翻译：RSPT: 重建环境与预测轨迹，实现泛化主动物体追踪

Fangwei Zhong,Xiao Bi,Yudi Zhang,Wei Zhang,Yizhou Wang

from arxiv, AAAI 2023 (Oral)

Active Object Tracking (AOT) aims to maintain a specific relation between the tracker and object(s) by autonomously controlling the motion system of a tracker given observations. AOT has wide-ranging applications, such as in mobile robots and autonomous driving. However, building a generalizable active tracker that works robustly across different scenarios remains a challenge, especially in unstructured environments with cluttered obstacles and diverse layouts. We argue that constructing a state representation capable of modeling the geometry structure of the surroundings and the dynamics of the target is crucial for achieving this goal. To address this challenge, we present RSPT, a framework that forms a structure-aware motion representation by Reconstructing the Surroundings and Predicting the target Trajectory. Additionally, we enhance the generalization of the policy network by training in an asymmetric dueling mechanism. We evaluate RSPT on various simulated scenarios and show that it outperforms existing methods in unseen environments, particularly those with complex obstacles and layouts. We also demonstrate the successful transfer of RSPT to real-world settings. Project Website: https://sites.google.com/view/aot-rspt.

翻译：主动物体追踪（AOT）的目标是通过自主控制追踪器的运动系统来维持追踪器和目标之间的特定关系。AOT具有广泛的应用，例如移动机器人和自动驾驶。然而，在杂乱无序的环境中构建一个能够在不同场景中稳定运行的泛化主动追踪器仍然是一项挑战，尤其是那些充满复杂障碍物和多样化布局的环境。我们认为构建一个能够建模周围环境的几何结构和目标动力学的状态表示对于实现此目标至关重要。为了解决这个问题，我们提出了RSPT框架，通过重建环境和预测目标轨迹形成具有结构感知的运动表示。此外，我们通过进行不对称的决斗机制训练，进一步提高了策略网络的泛化能力。我们在各种模拟场景中评估了RSPT的性能，并表明它在未见过的环境中优于现有方法，特别是那些具有复杂障碍和布局的环境。我们还展示了将RSPT成功转移至实际环境。项目网站：https://sites.google.com/view/aot-rspt。

0

相关内容

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

泡泡机器人SLAM

22+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

多目主动相机智能监控关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

复杂场景点线光流三维重建模型的建立及鲁棒性分析

国家自然科学基金

2+阅读 · 2014年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

惯性与高阶特征辅助的图像动态环境感知方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

分层视觉模型及表观复杂变化的视觉目标跟踪方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Raptor码的无线体域网高效信道编码技术

国家自然科学基金

0+阅读 · 2013年12月31日

动态复杂未知环境下的移动机器人实时SLAM算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

基于三维外表增量模型的离散多摄像机系统多目标跟踪方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality

Arxiv

0+阅读 · 2023年5月26日

Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic Benchmark

Arxiv

0+阅读 · 2023年5月25日

Modeling and Control of a novel Variable Stiffness three DoF Wrist

Arxiv

0+阅读 · 2023年5月25日

Residual Dynamics Learning for Trajectory Tracking for Multi-rotor Aerial Vehicles

Arxiv

0+阅读 · 2023年5月25日

Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments

Arxiv

0+阅读 · 2023年5月24日

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Arxiv

0+阅读 · 2023年5月24日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

A Survey on Trajectory Data Management, Analytics, and Learning

A Survey on Trajectory Data Management, Analytics, and Learning

Arxiv

16+阅读 · 2020年3月25日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

Arxiv

10+阅读 · 2018年4月30日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

【CVPR 2022】基于可迁移GNN的自适应轨迹预测，Adaptive Trajectory Prediction via Transferable GNN

专知会员服务

47+阅读 · 2022年3月11日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

【泡泡一分钟】扫描环境：用于3D点云地图中场景识别的自我中心空间描述符

泡泡机器人SLAM

22+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality

Arxiv

0+阅读 · 2023年5月26日

Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic Benchmark

Arxiv

0+阅读 · 2023年5月25日

Modeling and Control of a novel Variable Stiffness three DoF Wrist

Arxiv

0+阅读 · 2023年5月25日

Residual Dynamics Learning for Trajectory Tracking for Multi-rotor Aerial Vehicles

Arxiv

0+阅读 · 2023年5月25日

Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments

Arxiv

0+阅读 · 2023年5月24日

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Arxiv

0+阅读 · 2023年5月24日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

A Survey on Trajectory Data Management, Analytics, and Learning

A Survey on Trajectory Data Management, Analytics, and Learning

Arxiv

16+阅读 · 2020年3月25日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

Arxiv

10+阅读 · 2018年4月30日

相关基金

多目主动相机智能监控关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

复杂场景点线光流三维重建模型的建立及鲁棒性分析

国家自然科学基金

2+阅读 · 2014年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

惯性与高阶特征辅助的图像动态环境感知方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

分层视觉模型及表观复杂变化的视觉目标跟踪方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Raptor码的无线体域网高效信道编码技术

国家自然科学基金

0+阅读 · 2013年12月31日

动态复杂未知环境下的移动机器人实时SLAM算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

移动机器人基于三维激光测距的室内场景认知与物体识别

国家自然科学基金

0+阅读 · 2012年12月31日

基于三维外表增量模型的离散多摄像机系统多目标跟踪方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员