PV3D:一幅光谱视频制作3生成模型 (PV3D: A 3D Generative Model for Portrait Video Generation) - 专知论文

会员服务 ·

0

GANs · MoDELS · 3D · 生成模型 · HTTPS ·

2023 年 2 月 1 日

PV3D: A 3D Generative Model for Portrait Video Generation

翻译：PV3D:一幅光谱视频制作3生成模型

Eric Zhongcong Xu,Jianfeng Zhang,Jun Hao Liew,Wenqing Zhang,Song Bai,Jiashi Feng,Mike Zheng Shou

from arxiv, Accepted to ICLR2023, Project Page https://showlab.github.io/pv3d

Recent advances in generative adversarial networks (GANs) have demonstrated the capabilities of generating stunning photo-realistic portrait images. While some prior works have applied such image GANs to unconditional 2D portrait video generation and static 3D portrait synthesis, there are few works successfully extending GANs for generating 3D-aware portrait videos. In this work, we propose PV3D, the first generative framework that can synthesize multi-view consistent portrait videos. Specifically, our method extends the recent static 3D-aware image GAN to the video domain by generalizing the 3D implicit neural representation to model the spatio-temporal space. To introduce motion dynamics to the generation process, we develop a motion generator by stacking multiple motion layers to generate motion features via modulated convolution. To alleviate motion ambiguities caused by camera/human motions, we propose a simple yet effective camera condition strategy for PV3D, enabling both temporal and multi-view consistent video generation. Moreover, PV3D introduces two discriminators for regularizing the spatial and temporal domains to ensure the plausibility of the generated portrait videos. These elaborated designs enable PV3D to generate 3D-aware motion-plausible portrait videos with high-quality appearance and geometry, significantly outperforming prior works. As a result, PV3D is able to support many downstream applications such as animating static portraits and view-consistent video motion editing. Code and models are released at https://showlab.github.io/pv3d.

翻译：基因对抗网络(GANs)的近期进步展示了生成惊人的摄影现实肖像图像的能力。虽然一些先前的作品应用了这样的图像 GANs 来无条件的 2D 肖像视频生成和静态的 3D 肖像合成, 但成功扩展 GANs 生成 3D 肖像视频的作品却很少。在这项工作中, 我们提议了 PV3D, 这是第一个能够综合多视图一致的肖像视频的基因化框架。具体地说, 我们的方法通过将 3D 隐含的神经图象推广到视频域, 将 3D 隐含的 GAN 推广到视频域, 模拟空间- 时空空间空间。为了向生成过程引入运动动态动态动态动态动态, 我们开发了一个运动生成多个运动层, 通过调制调调调调的组合组合组合组合生成运动特征。为了减轻摄影机/人类动作造成的动作模糊性, 我们为PV3D提出一个简单而有效的摄像策略, 使得时间和多视图相一致生成。此外, PV3D 引入了两种空间和时空域规范化模型, 以确保视频模型, 的模型是确保所生成的图像图像的快速图像的快速图像的快速图像的快速展示3 。这些设计使得一个快速图像的图像生成的图像生成的图像生成的图像的图像生成的图像生成成为了一种高性版本3 。

0

相关内容

GANs

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

【KDD2020教程】多模态网络表示学习

【KDD2020教程】多模态网络表示学习

专知会员服务

132+阅读 · 2020年8月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

基于石墨烯的液体燃料分布式点火及催化微燃烧机理与特性研究

国家自然科学基金

0+阅读 · 2015年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

热氧化法制备微米中空碳化硅陶瓷纤维管及氧化扩散模型的建立

国家自然科学基金

0+阅读 · 2014年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

高能结构纳米粒子/氧化铝的封装构筑及其脱VOCs催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

(In,Ga)2Te3一维纳米结构及其核壳复合材料的可控制备与光电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳纳米管负载的双金属纳米粒子复合材料的制备及应用

国家自然科学基金

0+阅读 · 2012年12月31日

II/VI族半导体纳米粒子/层状无机氧化物复合材料的软化学合成及其发光性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

充流多壁碳纳米管的流体结构相互作用机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Generalized Framework for Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月24日

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Arxiv

0+阅读 · 2023年3月24日

Persistent Nature: A Generative Model of Unbounded 3D Worlds

Arxiv

0+阅读 · 2023年3月23日

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

Arxiv

1+阅读 · 2023年3月23日

Learning 3D-aware Image Synthesis with Unknown Pose Distribution

Arxiv

0+阅读 · 2023年3月23日

NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions

Arxiv

0+阅读 · 2023年3月22日

Pix2Video: Video Editing using Image Diffusion

Arxiv

0+阅读 · 2023年3月22日

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

Arxiv

0+阅读 · 2023年3月22日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

【KDD2020教程】多模态网络表示学习

【KDD2020教程】多模态网络表示学习

专知会员服务

132+阅读 · 2020年8月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】语义提示扩散变换器的像素级精确深度估计

俄乌冲突的地缘政治与军事教训（万字长文）

【博士论文】弥合多模态基础模型与世界模型之间的鸿沟

量子增强计算机视觉：超越经典算法

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

A Generalized Framework for Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月24日

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Arxiv

0+阅读 · 2023年3月24日

Persistent Nature: A Generative Model of Unbounded 3D Worlds

Arxiv

0+阅读 · 2023年3月23日

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

Arxiv

1+阅读 · 2023年3月23日

Learning 3D-aware Image Synthesis with Unknown Pose Distribution

Arxiv

0+阅读 · 2023年3月23日

NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions

Arxiv

0+阅读 · 2023年3月22日

Pix2Video: Video Editing using Image Diffusion

Arxiv

0+阅读 · 2023年3月22日

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

Arxiv

0+阅读 · 2023年3月22日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

相关基金

基于石墨烯的液体燃料分布式点火及催化微燃烧机理与特性研究

国家自然科学基金

0+阅读 · 2015年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

热氧化法制备微米中空碳化硅陶瓷纤维管及氧化扩散模型的建立

国家自然科学基金

0+阅读 · 2014年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

高能结构纳米粒子/氧化铝的封装构筑及其脱VOCs催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

(In,Ga)2Te3一维纳米结构及其核壳复合材料的可控制备与光电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳纳米管负载的双金属纳米粒子复合材料的制备及应用

国家自然科学基金

0+阅读 · 2012年12月31日

II/VI族半导体纳米粒子/层状无机氧化物复合材料的软化学合成及其发光性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

充流多壁碳纳米管的流体结构相互作用机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员