通过全球背景条件条件制作的《了解人类形象制作的场景》 (Scene Aware Person Image Generation through Global Contextual Conditioning)

Person image generation is an intriguing yet challenging problem. However, this task becomes even more difficult under constrained situations. In this work, we propose a novel pipeline to generate and insert contextually relevant person images into an existing scene while preserving the global semantics. More specifically, we aim to insert a person such that the location, pose, and scale of the person being inserted blends in with the existing persons in the scene. Our method uses three individual networks in a sequential pipeline. At first, we predict the potential location and the skeletal structure of the new person by conditioning a Wasserstein Generative Adversarial Network (WGAN) on the existing human skeletons present in the scene. Next, the predicted skeleton is refined through a shallow linear network to achieve higher structural accuracy in the generated image. Finally, the target image is generated from the refined skeleton using another generative network conditioned on a given image of the target person. In our experiments, we achieve high-resolution photo-realistic generation results while preserving the general context of the scene. We conclude our paper with multiple qualitative and quantitative benchmarks on the results.

翻译：个人图像的生成是一个令人着迷但又具有挑战性的问题。但是,在受制约的情况下,这项任务变得更加困难。在这项工作中,我们提议建立一个新的管道,在保护全球语义的同时,将符合背景的个人图像生成并插入到现有场景中。更具体地说,我们的目标是插入一个人,使被插入的人的位置、姿势和规模与现场现有人员混合在一起。我们的方法在一条连续的管道中使用三个单独的网络。首先,我们通过在现场现有人类骨骼上设置一个瓦塞尔斯坦·吉纳蒂·德versarial网络(WGAN)来预测新人的潜在位置和骨骼结构。接下来,通过一个浅线性网络对预测的骨骼进行精细化,以便在生成的图像中实现更高的结构精准性。最后,目标图像是利用另一个精细的骨骼,以目标人给定的图像为条件。在我们的实验中,我们通过保存场景的总体背景,在完成我们的论文时,我们用多种定性和定量基准来得出高分辨率的摄影现实生成结果。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日