完全光光:完全有条件的光光光,用于更现实的图像生成 (Full-Glow: Fully conditional Glow for more realistic image generation)

Autonomous agents, such as driverless cars, require large amounts of labeled visual data for their training. A viable approach for acquiring such data is training a generative model with collected real data, and then augmenting the collected real dataset with synthetic images from the model, generated with control of the scene layout and ground truth labeling. In this paper we propose Full-Glow, a fully conditional Glow-based architecture for generating plausible and realistic images of novel street scenes given a semantic segmentation map indicating the scene layout. Benchmark comparisons show our model to outperform recent works in terms of the semantic segmentation performance of a pretrained PSPNet. This indicates that images from our model are, to a higher degree than from other models, similar to real images of the same kinds of scenes and objects, making them suitable as training data for a visual semantic segmentation or object recognition system.

翻译：无司机汽车等自主代理商的培训需要大量贴有标签的视觉数据。获取这些数据的一个可行办法是用收集到的真实数据来培训一种基因模型,然后用模型合成图像来补充收集到的真实数据集,这些图像是用模型的合成图像生成的,通过控制场景布局和地面真相标签生成的。在本文中,我们提议一个完全有条件的Glow结构,即完全有条件的Glow结构,用于生成新颖街道景象的可信和现实图像,并配有显示场景布局的语义分割图。基准比较表明,我们的模型在事先训练过的PSPNet的语义分割性表现方面比最近的工作要优。这表明,我们的模型图像与其他模型相比,在更高程度上类似于同类场景和物体的真实图像,因此适合用于视觉语义分割或对象识别系统的培训数据。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/