DiFaReli: 拓散光照人脸生成 (DiFaReli : Diffusion Face Relighting) - 专知论文

会员服务 ·

0

人脸生成 · 人脸 · 三维形状 · 光照模型 · 反射率 ·

2023 年 4 月 19 日

DiFaReli : Diffusion Face Relighting

翻译：DiFaReli: 拓散光照人脸生成

Puntawat Ponglertnapakorn,Nontawat Tritrong,Supasorn Suwajanakorn

We present a novel approach to single-view face relighting in the wild. Handling non-diffuse effects, such as global illumination or cast shadows, has long been a challenge in face relighting. Prior work often assumes Lambertian surfaces, simplified lighting models or involves estimating 3D shape, albedo, or a shadow map. This estimation, however, is error-prone and requires many training examples with lighting ground truth to generalize well. Our work bypasses the need for accurate estimation of intrinsic components and can be trained solely on 2D images without any light stage data, multi-view images, or lighting ground truth. Our key idea is to leverage a conditional diffusion implicit model (DDIM) for decoding a disentangled light encoding along with other encodings related to 3D shape and facial identity inferred from off-the-shelf estimators. We also propose a novel conditioning technique that eases the modeling of the complex interaction between light and geometry by using a rendered shading reference to spatially modulate the DDIM. We achieve state-of-the-art performance on standard benchmark Multi-PIE and can photorealistically relight in-the-wild images. Please visit our page: https://diffusion-face-relighting.github.io

翻译：我们提出了一种新方法，能够在不同的光照条件下对人脸进行单视角的生成。非漫反射的影响因素，如全局光照或投影阴影，长期以来一直是人脸生成中的一个难点。以往的工作通常假设兰伯特表面、简化的光照模型，或涉及估计三维形状、漫反射率或阴影图。然而，这种估计容易出错，需要大量的训练样本和光照真实度数据才能很好地推广。我们的工作跳过了准确估算内部组分的需要，可以仅依靠二维图像进行训练，而不需要任何光照真实度数据、多视图图像或多个样本。我们的主要思想是利用条件扩散隐式模型 (DDIM) 来解码出一个包含光照、三维形状和面部识别相关编码的解缠结光照编码。我们还提出了一种新的调节技术，通过使用渲染阴影参考空间调制 DDIM，以便更轻松地建模光线和几何之间的复杂相互作用。我们在标准基准 Multi-PIE 上实现了最先进的性能，并可以在野外图像上实现光线真实的生成。请访问我们的网页: https://diffusion-face-relighting.github.io

1

相关内容

人脸生成

【AAAI2023】用于复杂场景图像合成的特征金字塔扩散模型

【AAAI2023】用于复杂场景图像合成的特征金字塔扩散模型

专知会员服务

22+阅读 · 2022年12月5日

中科院自动化所17篇CVPR 2022 新作速览！

中科院自动化所17篇CVPR 2022 新作速览！

专知会员服务

20+阅读 · 2022年3月19日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【CVPR 2021】姿态可控的语音驱动说话人脸

专知会员服务

16+阅读 · 2021年5月13日

【CVPR2021】GAN人脸预训练模型

【CVPR2021】GAN人脸预训练模型

专知会员服务

24+阅读 · 2021年4月10日

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

专知会员服务

29+阅读 · 2020年3月26日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新7篇变分自编码器（VAE）相关论文—汉语诗歌、生成模型、跨模态、MR图像重建、机器翻译、推断、合成人脸

【论文推荐】最新7篇变分自编码器（VAE）相关论文—汉语诗歌、生成模型、跨模态、MR图像重建、机器翻译、推断、合成人脸

专知

11+阅读 · 2018年2月12日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

稀疏表示框架下融合整体结构信息和局部平滑约束的高逼真人脸素描合成方法

国家自然科学基金

1+阅读 · 2013年12月31日

直接甲醇燃料电池强抗中毒光助阳极的构筑与催化特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

高质量机动目标InISAR三维成像研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于虚拟成阵技术的DIFAR浮标网络测向方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

1,2,3,4,6-pentagalloyl-β-D-葡萄糖对模拟自然暴露的稀土的驱排作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多尺度边缘感知的图像平滑和分层编辑研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

玉米耐深播性状的基因/QTL精细定位及位点互作效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于敏感度仿真与全变分约束的并行磁共振成像算法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays

Arxiv

0+阅读 · 2023年6月5日

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

Arxiv

0+阅读 · 2023年6月2日

Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting

Arxiv

0+阅读 · 2023年6月2日

KEYword based Sampling (KEYS) for Large Language Models

Arxiv

0+阅读 · 2023年6月2日

FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models

Arxiv

0+阅读 · 2023年6月1日

Example-based Motion Synthesis via Generative Motion Matching

Arxiv

0+阅读 · 2023年6月1日

On the Identifiability of Nonlinear ICA: Sparsity and Beyond

Arxiv

0+阅读 · 2023年6月1日

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models

Arxiv

0+阅读 · 2023年5月31日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

51+阅读 · 2021年9月14日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2023】用于复杂场景图像合成的特征金字塔扩散模型

【AAAI2023】用于复杂场景图像合成的特征金字塔扩散模型

专知会员服务

22+阅读 · 2022年12月5日

中科院自动化所17篇CVPR 2022 新作速览！

中科院自动化所17篇CVPR 2022 新作速览！

专知会员服务

20+阅读 · 2022年3月19日

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【CVPR 2021】姿态可控的语音驱动说话人脸

专知会员服务

16+阅读 · 2021年5月13日

【CVPR2021】GAN人脸预训练模型

【CVPR2021】GAN人脸预训练模型

专知会员服务

24+阅读 · 2021年4月10日

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

【CVPR2020-Oral-牛津-Facebook】从单个图像进行端到端的视图合成，SynSin-View Synthesis

专知会员服务

29+阅读 · 2020年3月26日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新7篇变分自编码器（VAE）相关论文—汉语诗歌、生成模型、跨模态、MR图像重建、机器翻译、推断、合成人脸

【论文推荐】最新7篇变分自编码器（VAE）相关论文—汉语诗歌、生成模型、跨模态、MR图像重建、机器翻译、推断、合成人脸

专知

11+阅读 · 2018年2月12日

相关论文

Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays

Arxiv

0+阅读 · 2023年6月5日

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

Arxiv

0+阅读 · 2023年6月2日

Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting

Arxiv

0+阅读 · 2023年6月2日

KEYword based Sampling (KEYS) for Large Language Models

Arxiv

0+阅读 · 2023年6月2日

FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models

Arxiv

0+阅读 · 2023年6月1日

Example-based Motion Synthesis via Generative Motion Matching

Arxiv

0+阅读 · 2023年6月1日

On the Identifiability of Nonlinear ICA: Sparsity and Beyond

Arxiv

0+阅读 · 2023年6月1日

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models

Arxiv

0+阅读 · 2023年5月31日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

51+阅读 · 2021年9月14日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

相关基金

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

稀疏表示框架下融合整体结构信息和局部平滑约束的高逼真人脸素描合成方法

国家自然科学基金

1+阅读 · 2013年12月31日

直接甲醇燃料电池强抗中毒光助阳极的构筑与催化特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

高质量机动目标InISAR三维成像研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于虚拟成阵技术的DIFAR浮标网络测向方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

1,2,3,4,6-pentagalloyl-β-D-葡萄糖对模拟自然暴露的稀土的驱排作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多尺度边缘感知的图像平滑和分层编辑研究

国家自然科学基金

0+阅读 · 2012年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

玉米耐深播性状的基因/QTL精细定位及位点互作效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于敏感度仿真与全变分约束的并行磁共振成像算法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员