多平面图像生成: 制作 2D GAN 3D- Aware (Generative Multiplane Images: Making a 2D GAN 3D-Aware) - 专知论文

会员服务 ·

0

GaN · Branch · Less · 判别器 · FAST ·

2022 年 7 月 21 日

Generative Multiplane Images: Making a 2D GAN 3D-Aware

翻译：多平面图像生成: 制作 2D GAN 3D- Aware

Xiaoming Zhao,Fangchang Ma,David Güera,Zhile Ren,Alexander G. Schwing,Alex Colburn

from arxiv, ECCV2022; Project Page: https://xiaoming-zhao.github.io/projects/gmpi/

What is really needed to make an existing 2D GAN 3D-aware? To answer this question, we modify a classical GAN, i.e., StyleGANv2, as little as possible. We find that only two modifications are absolutely necessary: 1) a multiplane image style generator branch which produces a set of alpha maps conditioned on their depth; 2) a pose-conditioned discriminator. We refer to the generated output as a 'generative multiplane image' (GMPI) and emphasize that its renderings are not only high-quality but also guaranteed to be view-consistent, which makes GMPIs different from many prior works. Importantly, the number of alpha maps can be dynamically adjusted and can differ between training and inference, alleviating memory concerns and enabling fast training of GMPIs in less than half a day at a resolution of $1024^2$. Our findings are consistent across three challenging and common high-resolution datasets, including FFHQ, AFHQv2, and MetFaces.

翻译：为了让现有 2D GAN 3D-aware 成为现有 2D GAN 3D-aware 真正需要的是什么? 为了回答这个问题, 我们尽可能少修改古典 GAN, 即 StyleGANv2 。我们发现只有两处绝对需要修改:(1) 多平板图像风格生成分支, 产生一套以深度为条件的字母地图;(2) 装设条件的区分器。我们把生成的输出称为“ 遗传多平面图像 ” ( GMPI), 我们强调, 其内容不仅高质量, 而且还保证能与视觉一致, 这使得 GMAPI 与以前的许多作品不同。重要的是, 阿尔法地图的数量可以动态调整, 并且可以在培训和推断、减轻记忆问题和在不到半天的时间里以 1024+2 的分辨率对 GMPIs 进行快速培训之间, 我们的发现在三个挑战性和常见的高分辨率数据集之间是一致的, 包括 FFHQ、 AFHQ2 和MetFace 。

0

相关内容

GaN

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【推荐论文】多通道注意力选择GAN的图像到图像转换，Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

【推荐论文】多通道注意力选择GAN的图像到图像转换，Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

专知会员服务

30+阅读 · 2020年2月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

新型可激活超分子光敏剂的设计制备与构效关系

国家自然科学基金

0+阅读 · 2014年12月31日

菌株Pigmentiphaga sp.H8对3,5-二溴-4-羟基苯甲酸的降解及脱溴机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于面部解剖结构动力学模型与多模态时空数据耦合的人脸仿真

国家自然科学基金

0+阅读 · 2013年12月31日

晶胞厚度纳米片分子筛的创制及其催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

微流控薄液膜边缘边界层重构机理及其传热传质特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

层状过渡金属硫属化合物的合成、晶体结构和超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

多自由度哈密顿系统的动力学不稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

超分子模板方法设计与合成微-介孔多级孔道金属-有机骨架材料及其催化反应动力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

镍基单晶高温合金多轴低周疲劳损伤研究

国家自然科学基金

0+阅读 · 2008年12月31日

微通道内气液两相流及传质特性

国家自然科学基金

0+阅读 · 2008年12月31日

BareSkinNet: De-makeup and De-lighting via 3D Face Reconstruction

Arxiv

1+阅读 · 2022年9月19日

An Overview on the Generation and Detection of Synthetic and Manipulated Satellite Images

Arxiv

0+阅读 · 2022年9月19日

3D Cross Pseudo Supervision (3D-CPS): A semi-supervised nnU-Net architecture for abdominal organ segmentation

Arxiv

0+阅读 · 2022年9月19日

Masked Face Inpainting Through Residual Attention UNet

Arxiv

0+阅读 · 2022年9月19日

$Motion Detection in Diffraction Tomography by Common Circle Methods$

Motion Detection in Diffraction Tomography by Common Circle Methods

Arxiv

0+阅读 · 2022年9月16日

CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention

CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention

Arxiv

0+阅读 · 2022年9月16日

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

Arxiv

0+阅读 · 2022年9月15日

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?

Arxiv

0+阅读 · 2022年9月15日

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Arxiv

0+阅读 · 2022年9月15日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【推荐论文】多通道注意力选择GAN的图像到图像转换，Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

【推荐论文】多通道注意力选择GAN的图像到图像转换，Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

专知会员服务

30+阅读 · 2020年2月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

BareSkinNet: De-makeup and De-lighting via 3D Face Reconstruction

Arxiv

1+阅读 · 2022年9月19日

An Overview on the Generation and Detection of Synthetic and Manipulated Satellite Images

Arxiv

0+阅读 · 2022年9月19日

3D Cross Pseudo Supervision (3D-CPS): A semi-supervised nnU-Net architecture for abdominal organ segmentation

Arxiv

0+阅读 · 2022年9月19日

Masked Face Inpainting Through Residual Attention UNet

Arxiv

0+阅读 · 2022年9月19日

$Motion Detection in Diffraction Tomography by Common Circle Methods$

Motion Detection in Diffraction Tomography by Common Circle Methods

Arxiv

0+阅读 · 2022年9月16日

CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention

CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention

Arxiv

0+阅读 · 2022年9月16日

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

Arxiv

0+阅读 · 2022年9月15日

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?

Arxiv

0+阅读 · 2022年9月15日

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Arxiv

0+阅读 · 2022年9月15日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

相关基金

新型可激活超分子光敏剂的设计制备与构效关系

国家自然科学基金

0+阅读 · 2014年12月31日

菌株Pigmentiphaga sp.H8对3,5-二溴-4-羟基苯甲酸的降解及脱溴机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于面部解剖结构动力学模型与多模态时空数据耦合的人脸仿真

国家自然科学基金

0+阅读 · 2013年12月31日

晶胞厚度纳米片分子筛的创制及其催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

微流控薄液膜边缘边界层重构机理及其传热传质特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

层状过渡金属硫属化合物的合成、晶体结构和超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

多自由度哈密顿系统的动力学不稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

超分子模板方法设计与合成微-介孔多级孔道金属-有机骨架材料及其催化反应动力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

镍基单晶高温合金多轴低周疲劳损伤研究

国家自然科学基金

0+阅读 · 2008年12月31日

微通道内气液两相流及传质特性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员