Semantic segmentation is the pixel-wise labelling of an image. Since the problem is defined at the pixel level, determining image class labels only is not acceptable, but localising them at the original image pixel resolution is necessary. Boosted by the extraordinary ability of convolutional neural networks (CNN) in creating semantic, high level and hierarchical image features; several deep learning-based 2D semantic segmentation approaches have been proposed within the last decade. In this survey, we mainly focus on the recent scientific developments in semantic segmentation, specifically on deep learning-based methods using 2D images. We started with an analysis of the public image sets and leaderboards for 2D semantic segmentation, with an overview of the techniques employed in performance evaluation. In examining the evolution of the field, we chronologically categorised the approaches into three main periods, namely pre-and early deep learning era, the fully convolutional era, and the post-FCN era. We technically analysed the solutions put forward in terms of solving the fundamental problems of the field, such as fine-grained localisation and scale invariance. Before drawing our conclusions, we present a table of methods from all mentioned eras, with a summary of each approach that explains their contribution to the field. We conclude the survey by discussing the current challenges of the field and to what extent they have been solved.
翻译:语义分解是图像的像素标签。 由于在像素层次上定义了问题, 确定图像类标签是不可接受的, 确定图像类标签只能以原始图像像素解解析方式进行本地化是有必要的。 由进化神经网络(CNN)在创建语义、 高层次和等级图像特征方面的超乎寻常能力所推动的; 在过去十年中提出了若干基于深层次学习的 2D 语义分解方法。 在本次调查中, 我们主要关注语义分解的最新科学发展, 特别是使用 2D 图像的深层次学习法。 我们首先分析2D 语义分解的公共图像集和领导板, 并概述在绩效评估中所使用的技术。 在考察实地的演进过程时, 我们按时间将各种方法, 即早期的深层次学习时代, 完全革命时代, 以及FCN 后时代。 我们从技术上分析了在解决本领域基本问题方面所提出的解决办法, 例如精确的地域分化和规模, 之前我们从目前的表格 来解释这些地方化和规模的结论。