流感实时实时物体探测 (Real-time Object Detection for Streaming Perception) - 专知论文

会员服务 ·

0

流 · DFP · 模型评估 · SimPLe · 回合 ·

2022 年 3 月 23 日

Real-time Object Detection for Streaming Perception

翻译：流感实时实时物体探测

Jinrong Yang,Songtao Liu,Zeming Li,Xiaoping Li,Jian Sun

from arxiv, CVPR 2022 Accepted Paper

Autonomous driving requires the model to perceive the environment and (re)act within a low latency for safety. While past works ignore the inevitable changes in the environment after processing, streaming perception is proposed to jointly evaluate the latency and accuracy into a single metric for video online perception. In this paper, instead of searching trade-offs between accuracy and speed like previous works, we point out that endowing real-time models with the ability to predict the future is the key to dealing with this problem. We build a simple and effective framework for streaming perception. It equips a novel DualFlow Perception module (DFP), which includes dynamic and static flows to capture the moving trend and basic detection feature for streaming prediction. Further, we introduce a Trend-Aware Loss (TAL) combined with a trend factor to generate adaptive weights for objects with different moving speeds. Our simple method achieves competitive performance on Argoverse-HD dataset and improves the AP by 4.9% compared to the strong baseline, validating its effectiveness. Our code will be made available at https://github.com/yancie-yjr/StreamYOLO.

翻译：自动驾驶要求模型在低纬度的低纬度内感知环境和(再)反应安全。虽然过去的工作忽视了处理后不可避免的环境变化,但建议对流感进行联合评价,将潜值和准确度纳入一个单一的在线视频感知指标。在本文中,我们不象以前的工作那样在精确度和速度之间寻找取舍,而是指出,具有预测未来能力的实时模型是解决这一问题的关键。我们为流化感构建了一个简单有效的框架。它装备了一个全新的双光谱感知模块(DFP),其中包括动态和静态流,以捕捉流动趋势,以及流化预测的基本检测特征。此外,我们引入了Trend-Aware Loss(TAL),加上一个趋势系数,以生成移动速度不同的物体的适应权重。我们的简单方法在Argoververs-HD数据集上取得了竞争性的性表现,并将AP比强基线提高4.9%,并验证其有效性。我们的代码将在 https://github.com/yancie-yor/Stream-YOL/Stream 中提供。

0

相关内容

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】实时实例分割的稀疏实例激活，Sparse Instance Activation for Real-Time Instance Segmentation

【CVPR 2022】实时实例分割的稀疏实例激活，Sparse Instance Activation for Real-Time Instance Segmentation

专知会员服务

8+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

专知会员服务

27+阅读 · 2020年1月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

控制系统的约束矩阵方程及其高效数值算法

国家自然科学基金

0+阅读 · 2013年12月31日

基于新型小生境策略的多模态、多目标、动态进化算法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于贝叶斯估计理论的自主机器人双目视觉相机量化误差数据最优融合方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于空时频联合处理的水下远距离运动目标成像研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于水下无线传感网络的目标跟踪研究

国家自然科学基金

2+阅读 · 2012年12月31日

医疗服务中的资源调度与优化

国家自然科学基金

4+阅读 · 2011年12月31日

高效生防菌Bs-916的突变菌株M49表面活性素合成能力降低的内在分子机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

GazeOnce: Real-Time Multi-Person Gaze Estimation

Arxiv

0+阅读 · 2022年4月20日

CenterNet++ for Object Detection

Arxiv

0+阅读 · 2022年4月18日

Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving

Arxiv

0+阅读 · 2022年4月17日

Scalable and Real-time Multi-Camera Vehicle Detection, Re-Identification, and Tracking

Arxiv

0+阅读 · 2022年4月15日

FasterVideo: Efficient Online Joint Object Detection And Tracking

FasterVideo: Efficient Online Joint Object Detection And Tracking

Arxiv

0+阅读 · 2022年4月15日

Dense Learning based Semi-Supervised Object Detection

Arxiv

9+阅读 · 2022年4月15日

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

Arxiv

11+阅读 · 2021年12月9日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

A Robust Real-Time Automatic License Plate Recognition based on the YOLO Detector

Arxiv

13+阅读 · 2018年3月1日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】实时实例分割的稀疏实例激活，Sparse Instance Activation for Real-Time Instance Segmentation

【CVPR 2022】实时实例分割的稀疏实例激活，Sparse Instance Activation for Real-Time Instance Segmentation

专知会员服务

8+阅读 · 2022年3月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

专知会员服务

27+阅读 · 2020年1月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《电磁（电子）战：英国能力》最新32页报告

《美军条令：斯特赖克步兵步枪排与班作战条令》最新450页

《美海军分布式海上作战（DMO）概念：最新情况》

《跨时空与跨模态学习事件模式构建体系（LESTAT）》57页DARPA研究报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

GazeOnce: Real-Time Multi-Person Gaze Estimation

Arxiv

0+阅读 · 2022年4月20日

CenterNet++ for Object Detection

Arxiv

0+阅读 · 2022年4月18日

Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving

Arxiv

0+阅读 · 2022年4月17日

Scalable and Real-time Multi-Camera Vehicle Detection, Re-Identification, and Tracking

Arxiv

0+阅读 · 2022年4月15日

FasterVideo: Efficient Online Joint Object Detection And Tracking

FasterVideo: Efficient Online Joint Object Detection And Tracking

Arxiv

0+阅读 · 2022年4月15日

Dense Learning based Semi-Supervised Object Detection

Arxiv

9+阅读 · 2022年4月15日

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

Arxiv

11+阅读 · 2021年12月9日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

A Robust Real-Time Automatic License Plate Recognition based on the YOLO Detector

Arxiv

13+阅读 · 2018年3月1日

相关基金

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

控制系统的约束矩阵方程及其高效数值算法

国家自然科学基金

0+阅读 · 2013年12月31日

基于新型小生境策略的多模态、多目标、动态进化算法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于贝叶斯估计理论的自主机器人双目视觉相机量化误差数据最优融合方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于空时频联合处理的水下远距离运动目标成像研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于水下无线传感网络的目标跟踪研究

国家自然科学基金

2+阅读 · 2012年12月31日

医疗服务中的资源调度与优化

国家自然科学基金

4+阅读 · 2011年12月31日

高效生防菌Bs-916的突变菌株M49表面活性素合成能力降低的内在分子机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员