SpetcherEdit: 无遮罩的本地图像使用部分策略处理 (SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches)

Sketch-based image manipulation is an interactive image editing task to modify an image based on input sketches from users. Existing methods typically formulate this task as a conditional inpainting problem, which requires users to draw an extra mask indicating the region to modify in addition to sketches. The masked regions are regarded as holes and filled by an inpainting model conditioned on the sketch. With this formulation, paired training data can be easily obtained by randomly creating masks and extracting edges or contours. Although this setup simplifies data preparation and model design, it complicates user interaction and discards useful information in masked regions. To this end, we investigate a new paradigm of sketch-based image manipulation: mask-free local image manipulation, which only requires sketch inputs from users and utilizes the entire original image. Given an image and sketch, our model automatically predicts the target modification region and encodes it into a structure agnostic style vector. A generator then synthesizes the new image content based on the style vector and sketch. The manipulated image is finally produced by blending the generator output into the modification region of the original image. Our model can be trained in a self-supervised fashion by learning the reconstruction of an image region from the style vector and sketch. The proposed method offers simpler and more intuitive user workflows for sketch-based image manipulation and provides better results than previous approaches. More results, code and interactive demo will be available at \url{https://zengxianyu.github.io/sketchedit}.

翻译：基于 Scletch 的图像操纵是一种交互式图像编辑任务, 目的是根据用户输入的草图修改图像。现有方法通常将此项任务设计成一个有条件的油漆问题, 要求用户在草图之外再绘制一个额外的遮罩, 显示区域修改。遮蔽区域被视为洞, 由素描上的一个涂鸦模型来填充。有了这个配对培训数据, 可以通过随机创建遮罩和提取边框或轮廓来轻易获得。虽然这个设置简化了数据编制和模型设计, 但它使用户互动和丢弃了在遮蔽区域有用的信息。为此, 我们调查基于素描图的图像操纵新模式: 无遮罩本地图像操作, 只需要用户的草图输入草图, 并使用全部原始图像的油漆模型和草图。我们的模型自动预测目标修改区域, 并将它编成结构风格矢量矢量矢量矢量矢量矢量矢量矢量和素设计。操纵图像最终通过将发电机输出输出输出输出输出输出到原始图像的修改区域。我们经过培训的模型将提供一个更精确的版本, 将提供一个更精确的矢量矢量图像的模型, 将提供更精确的模型, 将提供更精确的版本的版本。将提供更精确的版本的版本的版本的版本的图像的模型, 。将提供一个的模型将提供一个模型将提供更精确的图像。将提供更精确的模型将提供更精确的图像。