(TensorFlow)实时语义分割比较研究

2018 年 3 月 12 日 机器学习研究会
(TensorFlow)实时语义分割比较研究
                                                                                                                                                                                 
点击上方 “机器学习研究会”可以订阅
摘要
 

转自:爱可可-爱生活

论文《RTSeg: Real-time Semantic Segmentation Comparative Study》摘要:

Semantic segmentation benefits robotics related applications especially autonomous driving. Most of the research on semantic segmentation is only on increasing the accuracy of segmentation models with little attention to computationally efficient solutions. The few work conducted in this direction does not provide principled methods to evaluate the different design choices for segmentation. In this paper, we address this gap by presenting a real-time semantic segmentation benchmarking framework with a decoupled design for feature extraction and decoding methods. The framework is comprised of different network architectures for feature extraction such as VGG16, Resnet18, MobileNet, and ShuffleNet. It is also comprised of multiple meta-architectures for segmentation that define the decoding methodology. These include SkipNet, UNet, and Dilation Frontend. Experimental results are presented on the Cityscapes dataset for urban scenes. The modular design allows novel architectures to emerge, that lead to 143x GFLOPs reduction in comparison to SegNet. This benchmarking framework will be made publicly available at 1 .

The repository contains the official TensorFlow code used in the our paper RTSEG: REAL-TIME SEMANTIC SEGMENTATION COMPARATIVE STUDY for comparing different realtime semantic segmentation architectures.

Description

Semantic segmentation benefits robotics related applications especially autonomous driving. Most of the research on semantic segmentation is only on increasing the accuracy of segmentation models with little attention to computationally efficient solutions. The few work conducted in this direction does not provide principled methods to evaluate the different design choices for segmentation. In this paper, we address this gap by presenting a real-time semantic segmentation benchmarking framework with a decoupled design for feature extraction and decoding methods. The code and the experimental results are presented on the CityScapes dataset for urban scenes.


论文链接:

https://www.arxiv-vanity.com/papers/1803.02758/


代码链接:

https://github.com/MSiam/TFSegmentation


原文链接:

https://m.weibo.cn/1402400261/4216793734309974

“完整内容”请点击【阅读原文】
↓↓↓
登录查看更多
9

相关内容

语义分割,在机器学习上,多指对一段文本或者一张图片,提取其中有意义的部分,我们将这些有意义的部分称为语义单元,将这些语义单元提取出来的过程,称为语义分割。

Existing Earth Vision datasets are either suitable for semantic segmentation or object detection. In this work, we introduce the first benchmark dataset for instance segmentation in aerial imagery that combines instance-level object detection and pixel-level segmentation tasks. In comparison to instance segmentation in natural scenes, aerial images present unique challenges e.g., a huge number of instances per image, large object-scale variations and abundant tiny objects. Our large-scale and densely annotated Instance Segmentation in Aerial Images Dataset (iSAID) comes with 655,451 object instances for 15 categories across 2,806 high-resolution images. Such precise per-pixel annotations for each instance ensure accurate localization that is essential for detailed scene analysis. Compared to existing small-scale aerial image based instance segmentation datasets, iSAID contains 15$\times$ the number of object categories and 5$\times$ the number of instances. We benchmark our dataset using two popular instance segmentation approaches for natural images, namely Mask R-CNN and PANet. In our experiments we show that direct application of off-the-shelf Mask R-CNN and PANet on aerial images provide suboptimal instance segmentation results, thus requiring specialized solutions from the research community. The dataset is publicly available at: https://captain-whu.github.io/iSAID/index.html

0
7
下载
预览

Semantic segmentation requires both rich spatial information and sizeable receptive field. However, modern approaches usually compromise spatial resolution to achieve real-time inference speed, which leads to poor performance. In this paper, we address this dilemma with a novel Bilateral Segmentation Network (BiSeNet). We first design a Spatial Path with a small stride to preserve the spatial information and generate high-resolution features. Meanwhile, a Context Path with a fast downsampling strategy is employed to obtain sufficient receptive field. On top of the two paths, we introduce a new Feature Fusion Module to combine features efficiently. The proposed architecture makes a right balance between the speed and segmentation performance on Cityscapes, CamVid, and COCO-Stuff datasets. Specifically, for a 2048x1024 input, we achieve 68.4% Mean IOU on the Cityscapes test dataset with speed of 105 FPS on one NVIDIA Titan XP card, which is significantly faster than the existing methods with comparable performance.

0
4
下载
预览

In this work, we evaluate the use of superpixel pooling layers in deep network architectures for semantic segmentation. Superpixel pooling is a flexible and efficient replacement for other pooling strategies that incorporates spatial prior information. We propose a simple and efficient GPU-implementation of the layer and explore several designs for the integration of the layer into existing network architectures. We provide experimental results on the IBSR and Cityscapes dataset, demonstrating that superpixel pooling can be leveraged to consistently increase network accuracy with minimal computational overhead. Source code is available at https://github.com/bermanmaxim/superpixPool

0
5
下载
预览

We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. The method, called Mask R-CNN, extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. Mask R-CNN is simple to train and adds only a small overhead to Faster R-CNN, running at 5 fps. Moreover, Mask R-CNN is easy to generalize to other tasks, e.g., allowing us to estimate human poses in the same framework. We show top results in all three tracks of the COCO suite of challenges, including instance segmentation, bounding-box object detection, and person keypoint detection. Without bells and whistles, Mask R-CNN outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners. We hope our simple and effective approach will serve as a solid baseline and help ease future research in instance-level recognition. Code has been made available at: https://github.com/facebookresearch/Detectron

0
7
下载
预览
小贴士
相关资讯
Github项目推荐 | AutoML与轻量模型列表
AI研习社
9+阅读 · 2019年5月4日
弱监督语义分割最新方法资源列表
专知
7+阅读 · 2019年2月26日
TorchSeg:基于pytorch的语义分割算法开源了
极市平台
19+阅读 · 2019年1月28日
语义分割+视频分割开源代码集合
极市平台
35+阅读 · 2018年3月5日
【推荐】(TensorFlow)SSD实时手部检测与追踪(附代码)
机器学习研究会
8+阅读 · 2017年12月5日
【推荐】用Python/OpenCV实现增强现实
机器学习研究会
5+阅读 · 2017年11月16日
【推荐】YOLO实时目标检测(6fps)
机器学习研究会
16+阅读 · 2017年11月5日
【推荐】卷积神经网络类间不平衡问题系统研究
机器学习研究会
6+阅读 · 2017年10月18日
【推荐】全卷积语义分割综述
机器学习研究会
17+阅读 · 2017年8月31日
相关VIP内容
专知会员服务
38+阅读 · 2020年6月20日
专知会员服务
41+阅读 · 2020年3月19日
专知会员服务
97+阅读 · 2020年3月12日
《强化学习—使用 Open AI、TensorFlow和Keras实现》174页pdf
专知会员服务
97+阅读 · 2020年3月1日
专知会员服务
42+阅读 · 2020年2月26日
2019年机器学习框架回顾
专知会员服务
25+阅读 · 2019年10月11日
[综述]深度学习下的场景文本检测与识别
专知会员服务
41+阅读 · 2019年10月10日
计算机视觉最佳实践、代码示例和相关文档
专知会员服务
9+阅读 · 2019年10月9日
TensorFlow 2.0 学习资源汇总
专知会员服务
41+阅读 · 2019年10月9日
【哈佛大学商学院课程Fall 2019】机器学习可解释性
专知会员服务
54+阅读 · 2019年10月9日
相关论文
Learning Dynamic Routing for Semantic Segmentation
Yanwei Li,Lin Song,Yukang Chen,Zeming Li,Xiangyu Zhang,Xingang Wang,Jian Sun
5+阅读 · 2020年3月23日
iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images
Syed Waqas Zamir,Aditya Arora,Akshita Gupta,Salman Khan,Guolei Sun,Fahad Shahbaz Khan,Fan Zhu,Ling Shao,Gui-Song Xia,Xiang Bai
7+阅读 · 2019年8月28日
S4Net: Single Stage Salient-Instance Segmentation
Ruochen Fan,Ming-Ming Cheng,Qibin Hou,Tai-Jiang Mu,Jingdong Wang,Shi-Min Hu
7+阅读 · 2019年4月10日
Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation
Chenxi Liu,Liang-Chieh Chen,Florian Schroff,Hartwig Adam,Wei Hua,Alan Yuille,Li Fei-Fei
5+阅读 · 2019年1月10日
Juntang Zhuang,Junlin Yang
7+阅读 · 2018年12月10日
Ken C. L. Wong,Tanveer Syeda-Mahmood,Mehdi Moradi
4+阅读 · 2018年8月15日
BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation
Changqian Yu,Jingbo Wang,Chao Peng,Changxin Gao,Gang Yu,Nong Sang
4+阅读 · 2018年8月2日
Automatically Designing CNN Architectures for Medical Image Segmentation
Aliasghar Mortazi,Ulas Bagci
9+阅读 · 2018年7月19日
Mathijs Schuurmans,Maxim Berman,Matthew B. Blaschko
5+阅读 · 2018年6月7日
Kaiming He,Georgia Gkioxari,Piotr Dollár,Ross Girshick
7+阅读 · 2018年1月24日
Top