Image matting refers to extracting precise alpha matte from natural images, and it plays a critical role in various downstream applications, such as image editing. Despite being an ill-posed problem, traditional methods have been trying to solve it for decades. The emergence of deep learning has revolutionized the field of image matting and given birth to multiple new techniques, including automatic, interactive, and referring image matting. This paper presents a comprehensive review of recent advancements in image matting in the era of deep learning. We focus on two fundamental sub-tasks: auxiliary input-based image matting, which involves user-defined input to predict the alpha matte, and automatic image matting, which generates results without any manual intervention. We systematically review the existing methods for these two tasks according to their task settings and network structures and provide a summary of their advantages and disadvantages. Furthermore, we introduce the commonly used image matting datasets and evaluate the performance of representative matting methods both quantitatively and qualitatively. Finally, we discuss relevant applications of image matting and highlight existing challenges and potential opportunities for future research. We also maintain a public repository to track the rapid development of deep image matting at https://github.com/JizhiziLi/matting-survey.
翻译:图像抠图是从自然图像中精确提取 alpha 通道的过程,它在诸多下游应用中扮演着关键角色,例如图像编辑。虽然它是一个逆问题,但是传统方法已经在这方面进行了数十年的尝试。深度学习的出现彻底颠覆了图像抠图领域,孕育出了多种新技术,包括自动抠图、交互抠图和参考抠图。本文对深度学习时代图像抠图领域中的最近进展进行了全面回顾。我们将重点关注两个基本子任务:基于辅助输入的图像抠图,这涉及用户定义输入来预测 alpha 通道;以及自动图像抠图,这生成不需要任何手动干预的结果。我们按照任务设置和网络结构系统地回顾了这两个任务的现有方法,并总结了它们的优缺点。此外,我们介绍了常用的图像抠图数据集,定量和定性地评估了代表抠图方法的性能。最后,我们讨论了图像抠图的相关应用,强调了现有的挑战和未来研究的潜在机遇。我们还维护了一个公共代码库,以跟踪深度图像抠图的快速发展,网址为 https://github.com/JizhiziLi/matting-survey。