语义图象分割:两个研究十年 (Semantic Image Segmentation: Two Decades of Research)

from arxiv, Pre-print of the book: G. Csurka, R. Volpi and B. Chidlovski: Semantic Image Segmentation: Two Decades of Research, FTCGV (14): No. 1-2, http://dx.doi.org/10.1561/0600000095. The authors retained the copyright and are allowed to post it on arXiv. Research only use, commercial use or systematic downloading (by robots or other automatic processes) is prohibited

Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer vision applications, providing key information for the global understanding of an image. This survey is an effort to summarize two decades of research in the field of SiS, where we propose a literature review of solutions starting from early historical methods followed by an overview of more recent deep learning methods including the latest trend of using transformers. We complement the review by discussing particular cases of the weak supervision and side machine learning techniques that can be used to improve the semantic segmentation such as curriculum, incremental or self-supervised learning. State-of-the-art SiS models rely on a large amount of annotated samples, which are more expensive to obtain than labels for tasks such as image classification. Since unlabeled data is instead significantly cheaper to obtain, it is not surprising that Unsupervised Domain Adaptation (UDA) reached a broad success within the semantic segmentation community. Therefore, a second core contribution of this book is to summarize five years of a rapidly growing field, Domain Adaptation for Semantic Image Segmentation (DASiS) which embraces the importance of semantic segmentation itself and a critical need of adapting segmentation models to new environments. In addition to providing a comprehensive survey on DASiS techniques, we unveil also newer trends such as multi-domain learning, domain generalization, domain incremental learning, test-time adaptation and source-free domain adaptation. Finally, we conclude this survey by describing datasets and benchmarks most widely used in SiS and DASiS and briefly discuss related tasks such as instance and panoptic image segmentation, as well as applications such as medical image segmentation.

翻译：语义图像分解 (SiS) 在一系列广泛的计算机图像应用中发挥着根本作用, 提供了关键信息, 供全球理解图像。本调查旨在总结Sis领域20年的研究, 我们建议从早期历史方法开始对解决方案进行文献审查, 然后从早期历史方法开始对最新的深层学习方法, 包括使用变压器的最新趋势进行概览。我们通过讨论可用于改进语义解解解析的特定案例, 比如课程、递增或自我监督的学习等。州- 艺术SisS 模型依赖于大量的附加标注的样本, 这比图像分类等任务标签要贵得多。由于未加贴标签的数据要便宜得多, 我们并不奇怪, 未加固的 Domain Adis 适应(UDA) 在语义分解界中取得了广泛的成功。因此, 这本书的第二个核心贡献是总结一个快速增长的字段, Domainal addition for Semantical discritionation (DS) subly subilate real real ditionalational distration ladistration (D) (DSie) ladistration