DAG: 与Disoising扩散概率模型的深度软件指导 (DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models) - 专知论文

会员服务 ·

0

Guidance · MoDELS · 估计/估计量 · DAG · 去噪 ·

2023 年 1 月 30 日

DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models

翻译：DAG: 与Disoising扩散概率模型的深度软件指导

Gyeongnyeon Kim,Wooseok Jang,Gyuseong Lee,Susung Hong,Junyoung Seo,Seungryong Kim

from arxiv, Project page is available at https://ku-cvlab.github.io/DAG/

Generative models have recently undergone significant advancement due to the diffusion models. The success of these models can be often attributed to their use of guidance techniques, such as classifier or classifier-free guidance, which provide effective mechanisms to trade-off between fidelity and diversity. However, these methods are not capable of guiding a generated image to be aware of its geometric configuration, e.g., depth, which hinders their application to areas that require a certain level of depth awareness. To address this limitation, we propose a novel guidance method for diffusion models that uses estimated depth information derived from the rich intermediate representations of diffusion models. We first present label-efficient depth estimation framework using internal representations of diffusion models. Subsequently, we propose the incorporation of two guidance techniques based on pseudo-labeling and depth-domain diffusion prior during the sampling phase to self-condition the generated image using the estimated depth map. Experiments and comprehensive ablation studies demonstrate the effectiveness of our method in guiding the diffusion models towards the generation of geometrically plausible images.

翻译：最近,由于推广模型的推广模式,生成模型最近取得了显著进步,这些模型的成功往往可归因于它们使用指导技术,如分类师或免分类师指导,这些指导技术提供了在忠诚和多样性之间取舍的有效机制,然而,这些方法无法指导生成图像了解其几何配置,例如深度,这妨碍了将其应用于需要某种深度认识的领域。为了解决这一局限性,我们提议了一种新的传播模型指导方法,该方法使用从传播模型丰富的中间表示中得出的估计深度信息。我们首先利用内部的传播模型的表述提出贴标签效率的深度估计框架。随后,我们提议在取样阶段之前采用两种基于伪标签和深度分布的指导技术,以便利用估计深度图对生成图像进行自我调节。实验和全面化研究表明我们指导传播模型用于生成几何合理图像的方法的有效性。

0

相关内容

Guidance

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

调控马铃薯干旱胁迫响应相关转录因子的miRNA功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-TUSC7在胃癌中的抑癌作用及机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Septin7活化Ca2+/CaN/NFAT2信号途径在糖尿病肾病足细胞损伤中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

一条全新的MPK-WRKY途径调控油菜防御核盘菌的分子机制解析

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

miR-223调控动脉粥样硬化斑块泡沫细胞形成和巨噬细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

miR-370-LIN28A信号通路在肝癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元增强宽光谱InGaN太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞色素P-450表氧化酶与5-脂氧酶调控动脉粥样硬化慢性炎症的作用与分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

肾脏特异性miR-215调控Smad7在DN肾小球硬化发生中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年3月20日

Imagic: Text-Based Real Image Editing with Diffusion Models

Arxiv

0+阅读 · 2023年3月20日

Leaping Into Memories: Space-Time Deep Feature Synthesis

Arxiv

0+阅读 · 2023年3月20日

Diffusion-HPC: Generating Synthetic Images with Realistic Humans

Diffusion-HPC: Generating Synthetic Images with Realistic Humans

Arxiv

0+阅读 · 2023年3月16日

DiffIR: Efficient Diffusion Model for Image Restoration

Arxiv

0+阅读 · 2023年3月16日

Synthetic ECG Signal Generation using Probabilistic Diffusion Models

Arxiv

0+阅读 · 2023年3月16日

Deep Incubation: Training Large Models by Divide-and-Conquering

Arxiv

0+阅读 · 2023年3月16日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

人工智能赋能自主武器与人类控制第一部分：人类控制与机器学习的设计和开发 | 46页

军事指挥控制系统：2025年5种用途

人工智能赋能自主武器与人类控制第二部分：人类控制与军事指挥官 | 38页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年3月20日

Imagic: Text-Based Real Image Editing with Diffusion Models

Arxiv

0+阅读 · 2023年3月20日

Leaping Into Memories: Space-Time Deep Feature Synthesis

Arxiv

0+阅读 · 2023年3月20日

Diffusion-HPC: Generating Synthetic Images with Realistic Humans

Diffusion-HPC: Generating Synthetic Images with Realistic Humans

Arxiv

0+阅读 · 2023年3月16日

DiffIR: Efficient Diffusion Model for Image Restoration

Arxiv

0+阅读 · 2023年3月16日

Synthetic ECG Signal Generation using Probabilistic Diffusion Models

Arxiv

0+阅读 · 2023年3月16日

Deep Incubation: Training Large Models by Divide-and-Conquering

Arxiv

0+阅读 · 2023年3月16日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

调控马铃薯干旱胁迫响应相关转录因子的miRNA功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-TUSC7在胃癌中的抑癌作用及机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Septin7活化Ca2+/CaN/NFAT2信号途径在糖尿病肾病足细胞损伤中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

一条全新的MPK-WRKY途径调控油菜防御核盘菌的分子机制解析

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

miR-223调控动脉粥样硬化斑块泡沫细胞形成和巨噬细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

miR-370-LIN28A信号通路在肝癌发生发展中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元增强宽光谱InGaN太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞色素P-450表氧化酶与5-脂氧酶调控动脉粥样硬化慢性炎症的作用与分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

肾脏特异性miR-215调控Smad7在DN肾小球硬化发生中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员