YolactEdge: 边缘的实时实例分割 (YolactEdge: Real-time Instance Segmentation on the Edge) - 专知论文

会员服务 ·

0

模型评估 · 边 · 示例 · FPS · YOLACT ·

2021 年 4 月 1 日

YolactEdge: Real-time Instance Segmentation on the Edge

翻译：YolactEdge: 边缘的实时实例分割

Haotian Liu,Rafael A. Rivera Soto,Fanyi Xiao,Yong Jae Lee

from arxiv, \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

We propose YolactEdge, the first competitive instance segmentation approach that runs on small edge devices at real-time speeds. Specifically, YolactEdge runs at up to 30.8 FPS on a Jetson AGX Xavier (and 172.7 FPS on an RTX 2080 Ti) with a ResNet-101 backbone on 550x550 resolution images. To achieve this, we make two improvements to the state-of-the-art image-based real-time method YOLACT: (1) applying TensorRT optimization while carefully trading off speed and accuracy, and (2) a novel feature warping module to exploit temporal redundancy in videos. Experiments on the YouTube VIS and MS COCO datasets demonstrate that YolactEdge produces a 3-5x speed up over existing real-time methods while producing competitive mask and box detection accuracy. We also conduct ablation studies to dissect our design choices and modules. Code and models are available at https://github.com/haotian-liu/yolact_edge.

翻译：我们提出YolactEdge,这是在小型边缘装置上以实时速度运行的第一个竞争性图像分割法。具体地说,YolactEdge在杰特森AgX XX Xavier(和172.7FPS,在RTX 2080 Ti)上运行多达30.8FPS,在550x550分辨率图像上使用ResNet-101主干线。为了实现这一点,我们改进了最先进的基于图像的实时方法YOLACT:(1) 应用TensorRT优化,同时谨慎地交换速度和准确性;(2) 开发视频中的时间冗余的新特征扭曲模块。YouTubeVIS和MS COCOCO数据集实验显示,YolactEdge在实时方法上生成了3-5x的速度,同时产生了竞争性的遮罩和盒探测精度。我们还开展了一个模拟研究,以解析我们的设计选择和模块。代码和模型可在https://github.com/haitian-liu/yolact_edge上查阅。

1

相关内容

模型评估

机器学习系统设计系统评估标准

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

【ICML2020】文本摘要生成模型PEGASUS

【ICML2020】文本摘要生成模型PEGASUS

专知会员服务

35+阅读 · 2020年8月23日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知会员服务

27+阅读 · 2020年4月1日

【边缘智能综述论文】A Survey on Edge Intelligence

【边缘智能综述论文】A Survey on Edge Intelligence

专知会员服务

122+阅读 · 2020年3月30日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【图像分割| 2019最新综述】理解图像分割的深度学习技术，附58页PDF（Understanding Deep Learning Techniques for Image Segmentation）

【图像分割| 2019最新综述】理解图像分割的深度学习技术，附58页PDF（Understanding Deep Learning Techniques for Image Segmentation）

专知会员服务

59+阅读 · 2019年11月16日

【图像分割| 2019最新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍，附35页PDF（Machine Learning Techniques for Biomedical Image Segmentation）

【图像分割| 2019最新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍，附35页PDF（Machine Learning Techniques for Biomedical Image Segmentation）

专知会员服务

49+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Feature Reuse and Fusion for Real-time Semantic segmentation

Arxiv

1+阅读 · 2021年5月27日

BoundarySqueeze: Image Segmentation as Boundary Squeezing

Arxiv

0+阅读 · 2021年5月25日

Rethinking BiSeNet For Real-time Semantic Segmentation

Arxiv

7+阅读 · 2021年4月27日

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Arxiv

8+阅读 · 2020年12月7日

S4Net: Single Stage Salient-Instance Segmentation

S4Net: Single Stage Salient-Instance Segmentation

Arxiv

10+阅读 · 2019年4月10日

ShelfNet for Real-time Semantic Segmentation

Arxiv

7+阅读 · 2018年12月10日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

Arxiv

9+阅读 · 2018年9月17日

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

Arxiv

4+阅读 · 2018年8月2日

Path Aggregation Network for Instance Segmentation

Arxiv

3+阅读 · 2018年3月5日

VIP会员

文章信息

相关主题

相关VIP内容

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

【ICML2020】文本摘要生成模型PEGASUS

【ICML2020】文本摘要生成模型PEGASUS

专知会员服务

35+阅读 · 2020年8月23日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知会员服务

27+阅读 · 2020年4月1日

【边缘智能综述论文】A Survey on Edge Intelligence

【边缘智能综述论文】A Survey on Edge Intelligence

专知会员服务

122+阅读 · 2020年3月30日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【图像分割| 2019最新综述】理解图像分割的深度学习技术，附58页PDF（Understanding Deep Learning Techniques for Image Segmentation）

【图像分割| 2019最新综述】理解图像分割的深度学习技术，附58页PDF（Understanding Deep Learning Techniques for Image Segmentation）

专知会员服务

59+阅读 · 2019年11月16日

【图像分割| 2019最新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍，附35页PDF（Machine Learning Techniques for Biomedical Image Segmentation）

【图像分割| 2019最新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍，附35页PDF（Machine Learning Techniques for Biomedical Image Segmentation）

专知会员服务

49+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

相关论文

Feature Reuse and Fusion for Real-time Semantic segmentation

Arxiv

1+阅读 · 2021年5月27日

BoundarySqueeze: Image Segmentation as Boundary Squeezing

Arxiv

0+阅读 · 2021年5月25日

Rethinking BiSeNet For Real-time Semantic Segmentation

Arxiv

7+阅读 · 2021年4月27日

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Arxiv

8+阅读 · 2020年12月7日

S4Net: Single Stage Salient-Instance Segmentation

S4Net: Single Stage Salient-Instance Segmentation

Arxiv

10+阅读 · 2019年4月10日

ShelfNet for Real-time Semantic Segmentation

Arxiv

7+阅读 · 2018年12月10日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

Arxiv

9+阅读 · 2018年9月17日

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

Arxiv

4+阅读 · 2018年8月2日

Path Aggregation Network for Instance Segmentation

Arxiv

3+阅读 · 2018年3月5日

微信扫码咨询专知VIP会员