Eagle：基于端到端深度强化学习的 PTZ 摄像头自主控制 (Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras) - 专知论文

会员服务 ·

0

Eagle · 自主控制 · 资源受限 · 端到端 · 深度强化学习 ·

2023 年 4 月 10 日

Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras

翻译：Eagle：基于端到端深度强化学习的 PTZ 摄像头自主控制

Sandeep Singh Sandha,Bharathan Balaji,Luis Garcia,Mani Srivastava

from arxiv, 20 pages, IoTDI

Existing approaches for autonomous control of pan-tilt-zoom (PTZ) cameras use multiple stages where object detection and localization are performed separately from the control of the PTZ mechanisms. These approaches require manual labels and suffer from performance bottlenecks due to error propagation across the multi-stage flow of information. The large size of object detection neural networks also makes prior solutions infeasible for real-time deployment in resource-constrained devices. We present an end-to-end deep reinforcement learning (RL) solution called Eagle to train a neural network policy that directly takes images as input to control the PTZ camera. Training reinforcement learning is cumbersome in the real world due to labeling effort, runtime environment stochasticity, and fragile experimental setups. We introduce a photo-realistic simulation framework for training and evaluation of PTZ camera control policies. Eagle achieves superior camera control performance by maintaining the object of interest close to the center of captured images at high resolution and has up to 17% more tracking duration than the state-of-the-art. Eagle policies are lightweight (90x fewer parameters than Yolo5s) and can run on embedded camera platforms such as Raspberry PI (33 FPS) and Jetson Nano (38 FPS), facilitating real-time PTZ tracking for resource-constrained environments. With domain randomization, Eagle policies trained in our simulator can be transferred directly to real-world scenarios.

翻译：现有的 PTZ 摄像头自主控制方法需要分别进行目标检测和定位，然后控制 PTZ 机制。这些方法需要手动标注，并且由于信息多阶段流的误差传播，会出现性能瓶颈。目标检测神经网络的大尺寸也使得现有解决方案无法用于资源受限设备的实时部署。我们提出了一种名为 Eagle 的端到端深度强化学习（RL）解决方案，用于训练一个直接接受图像输入以控制 PTZ 摄像头的神经网络策略。实时环境随机性、易碎的实验设置以及标注工作使得强化学习在现实中的训练非常困难。我们引入了一个逼真的仿真框架，用于 PTZ 摄像头控制策略的训练和评估。Eagle 通过使被追踪对象保持在高分辨率的图像中央，控制 PTZ 摄像头的性能优于现有的方法，追踪持续时间增长了高达 17%。Eagle 策略轻量级（参数数量是 Yolo5s 的 90 倍），可以在嵌入式摄像头平台（如 Raspberry PI 和 Jetson Nano）上运行，在资源受限环境中实现 PTZ 实时跟踪。通过域随机化，我们在仿真器中训练的 Eagle 策略可以直接迁移到真实世界的场景中。

0

相关内容

Eagle

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

【AI+军事】洛马AI中心paper速读：基于深度学习的多目标跟踪、轨迹预测，Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

【AI+军事】洛马AI中心paper速读：基于深度学习的多目标跟踪、轨迹预测，Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

专知会员服务

65+阅读 · 2022年3月22日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【KDD2021】基于分布式深度强化学习的节能3D车辆众包灾难响应

专知会员服务

13+阅读 · 2021年9月9日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

121+阅读 · 2020年5月18日

AI领域顶会AAMAS2020最佳论文出炉!《深度残差强化学习》牛津大学，Deep Residual RL

AI领域顶会AAMAS2020最佳论文出炉!《深度残差强化学习》牛津大学，Deep Residual RL

专知会员服务

45+阅读 · 2020年5月15日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

多目标跟踪：SORT和Deep SORT

多目标跟踪：SORT和Deep SORT

极市平台

47+阅读 · 2019年3月18日

ECCV2018|视觉目标跟踪之DaSiamRPN

ECCV2018|视觉目标跟踪之DaSiamRPN

极市平台

11+阅读 · 2018年8月22日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于立体视觉的微型扑翼机器人的自主飞行控制

国家自然科学基金

3+阅读 · 2014年12月31日

基于双时间尺度优化的多机器人策略自适应与一致性

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

四旋翼飞行器基于视觉的目标跟踪及自主动态降落

国家自然科学基金

1+阅读 · 2013年12月31日

火星精确着陆自主导航策略与鲁棒估计方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Junction tree推理的多运动平台分散式协同导航算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于全方位视觉跟踪的机器人轮椅的研究

国家自然科学基金

2+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于无线传感器网路的多移动机器人编队控制技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

无人机主动视觉运动目标自主导引关键技术基础研究

国家自然科学基金

2+阅读 · 2009年12月31日

Safety of autonomous vehicles: A survey on Model-based vs. AI-based approaches

Arxiv

0+阅读 · 2023年5月29日

Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints

Arxiv

0+阅读 · 2023年5月29日

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月29日

Action valuation of on- and off-ball soccer players based on multi-agent deep reinforcement learning

Arxiv

0+阅读 · 2023年5月29日

5G Network on Wings: A Deep Reinforcement Learning Approach to the UAV-based Integrated Access and Backhaul

Arxiv

0+阅读 · 2023年5月26日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

44+阅读 · 2022年8月2日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

深度强化学习

相关VIP内容

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

【AI+军事】洛马AI中心paper速读：基于深度学习的多目标跟踪、轨迹预测，Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

【AI+军事】洛马AI中心paper速读：基于深度学习的多目标跟踪、轨迹预测，Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

专知会员服务

65+阅读 · 2022年3月22日

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

【CVPR 2022】使用多模态Transformer的端到端视频对象分割，End-to-End Referring Video Object Segmentation with Multimodal Transformer

专知会员服务

28+阅读 · 2022年3月3日

【KDD2021】基于分布式深度强化学习的节能3D车辆众包灾难响应

专知会员服务

13+阅读 · 2021年9月9日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

121+阅读 · 2020年5月18日

AI领域顶会AAMAS2020最佳论文出炉!《深度残差强化学习》牛津大学，Deep Residual RL

AI领域顶会AAMAS2020最佳论文出炉!《深度残差强化学习》牛津大学，Deep Residual RL

专知会员服务

45+阅读 · 2020年5月15日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

多目标跟踪：SORT和Deep SORT

多目标跟踪：SORT和Deep SORT

极市平台

47+阅读 · 2019年3月18日

ECCV2018|视觉目标跟踪之DaSiamRPN

ECCV2018|视觉目标跟踪之DaSiamRPN

极市平台

11+阅读 · 2018年8月22日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Safety of autonomous vehicles: A survey on Model-based vs. AI-based approaches

Arxiv

0+阅读 · 2023年5月29日

Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints

Arxiv

0+阅读 · 2023年5月29日

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年5月29日

Action valuation of on- and off-ball soccer players based on multi-agent deep reinforcement learning

Arxiv

0+阅读 · 2023年5月29日

5G Network on Wings: A Deep Reinforcement Learning Approach to the UAV-based Integrated Access and Backhaul

Arxiv

0+阅读 · 2023年5月26日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

44+阅读 · 2022年8月2日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

基于立体视觉的微型扑翼机器人的自主飞行控制

国家自然科学基金

3+阅读 · 2014年12月31日

基于双时间尺度优化的多机器人策略自适应与一致性

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

四旋翼飞行器基于视觉的目标跟踪及自主动态降落

国家自然科学基金

1+阅读 · 2013年12月31日

火星精确着陆自主导航策略与鲁棒估计方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Junction tree推理的多运动平台分散式协同导航算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于全方位视觉跟踪的机器人轮椅的研究

国家自然科学基金

2+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于无线传感器网路的多移动机器人编队控制技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

无人机主动视觉运动目标自主导引关键技术基础研究

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员