As a common image editing operation, image composition aims to cut the foreground from one image and paste it on another image, resulting in a composite image. However, there are many issues that could make the composite images unrealistic. These issues can be summarized as the inconsistency between foreground and background, which includes appearance inconsistency (e.g., incompatible illumination), geometry inconsistency (e.g., unreasonable size), and semantic inconsistency (e.g., mismatched semantic context). Previous works divide image composition task into multiple sub-tasks, in which each sub-task targets at one or more issues. Specifically, object placement aims to find reasonable scale, location, and shape for the foreground. Image blending aims to address the unnatural boundary between foreground and background. Image harmonization aims to adjust the illumination statistics of foreground. Shadow generation aims to generate plausible shadow for the foreground. By putting all the abovementioned efforts together, we can acquire realistic composite images. To the best of our knowledge, there is no previous survey on image composition. In this paper, we conduct comprehensive survey over the sub-tasks of image composition. For each sub-task, we summarize the traditional methods, deep learning based methods, datasets and evaluation. We also point out the limitations of existing methods in each sub-task and the problem of the whole image composition task. Datasets and codes for image composition are summarized at https://github.com/bcmi/Awesome-Image-Composition.
翻译:作为共同的图像编辑操作,图像的构成旨在将图像的表面从一个图像上切开,将其粘贴在另一个图像上,从而形成一个复合图像。然而,有许多问题可能会使复合图像不现实。这些问题可以被概括为前景与背景之间的不一致,包括表面与背景之间的不一致(例如不相容的照明)、几何不一致(例如不合理的尺寸)和语义不一致(例如,不相配的语义背景)。以前的作品将图像构成任务分为多个子任务,每个子任务都包含在一个或多个问题上。具体地说,对象放置的目的是为背景找到合理的规模、位置和形状。图像混合的目的是解决背景和背景之间的非自然界限。图像统一的目的是调整前景的不相容统计数据。影子生成的目的是为背景创造可信的阴影。通过将所有上述努力,我们可以获得符合现实的复合图像。我们最了解的图像构成情况是以前的调查。在本文中,我们对每个子任务、地点和背景的图像构成进行全面的调查,我们在每个子任务中进行基于图像构成的子任务、我们基于数据总结的分类和亚方法。