内存高效的扩散概率模型：基于补丁的生成 (Memory Efficient Diffusion Probabilistic Models via Patch-based Generation) - 专知论文

会员服务 ·

0

概率模型 · 内存 · 概率 · 图像质量 · 高分辨率图像 ·

2023 年 4 月 14 日

Memory Efficient Diffusion Probabilistic Models via Patch-based Generation

翻译：内存高效的扩散概率模型：基于补丁的生成

Shinei Arakawa,Hideki Tsunashima,Daichi Horita,Keitaro Tanaka,Shigeo Morishima

from arxiv, Accepted to the Generative Models for Computer Vision workshop at CVPR 2023

Diffusion probabilistic models have been successful in generating high-quality and diverse images. However, traditional models, whose input and output are high-resolution images, suffer from excessive memory requirements, making them less practical for edge devices. Previous approaches for generative adversarial networks proposed a patch-based method that uses positional encoding and global content information. Nevertheless, designing a patch-based approach for diffusion probabilistic models is non-trivial. In this paper, we resent a diffusion probabilistic model that generates images on a patch-by-patch basis. We propose two conditioning methods for a patch-based generation. First, we propose position-wise conditioning using one-hot representation to ensure patches are in proper positions. Second, we propose Global Content Conditioning (GCC) to ensure patches have coherent content when concatenated together. We evaluate our model qualitatively and quantitatively on CelebA and LSUN bedroom datasets and demonstrate a moderate trade-off between maximum memory consumption and generated image quality. Specifically, when an entire image is divided into 2 x 2 patches, our proposed approach can reduce the maximum memory consumption by half while maintaining comparable image quality.

翻译：扩散概率模型在生成高质量和多样化的图像方面已经取得了成功。然而，对于边缘设备来说，传统的输入和输出都是高分辨率图像的模型会由于过多的内存需求而变得不太实用。前面的生成对抗网络方法提出了一种基于补丁的方法，该方法使用位置编码和全局内容信息。然而，为扩散概率模型设计一种基于补丁的方法是非常棘手的。在本文中，我们将提出一种在基于补丁的方式下进行图像生成的扩散概率模型。我们提出了两种补丁生成的条件方法。首先，我们提出了使用一位表示进行位置方面的条件，以确保补丁在正确的位置上。其次，我们提出了全局内容条件 (GCC) 方法，以确保在连接在一起时补丁的内容是一致的。我们在CelebA和LSUN卧室数据集上进行了定性和定量的评估，并展示了最大内存消耗和生成图像质量之间的适度权衡。具体而言，当整个图像被分割成2 x 2的补丁时，我们提出的方法可以将最大内存消耗减半，同时保持相当的图像质量。

0

相关内容

概率模型

概率模型(生成模型)通过函数 F 来描述 X 和 Y 的联合概率或者条件概率分布。

【ICML2023】通过离散扩散建模实现高效和度引导的图生成

【ICML2023】通过离散扩散建模实现高效和度引导的图生成

专知会员服务

21+阅读 · 2023年5月17日

【AAAI2023】不确定性感知的图像描述生成

【AAAI2023】不确定性感知的图像描述生成

专知会员服务

26+阅读 · 2022年12月4日

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

专知会员服务

14+阅读 · 2022年10月5日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【ECCV2020】EfficientFCN：语义分割中的整体引导解码器

【ECCV2020】EfficientFCN：语义分割中的整体引导解码器

专知会员服务

18+阅读 · 2020年8月23日

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

专知会员服务

34+阅读 · 2020年4月11日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【ICCV2019最佳论文官方代码】Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"(从单一自然图像中学习的无条件生成模型) 附PDF论文

【ICCV2019最佳论文官方代码】Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"(从单一自然图像中学习的无条件生成模型) 附PDF论文

专知会员服务

22+阅读 · 2019年11月2日

浅谈扩散模型的有分类器引导和无分类器引导

浅谈扩散模型的有分类器引导和无分类器引导

PaperWeekly

3+阅读 · 2022年12月1日

类数值方法PNDM：Stable Diffusion默认加速采样方案

类数值方法PNDM：Stable Diffusion默认加速采样方案

PaperWeekly

1+阅读 · 2022年11月4日

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

生成扩散模型漫谈：最优扩散方差估计（上）

生成扩散模型漫谈：最优扩散方差估计（上）

PaperWeekly

0+阅读 · 2022年9月25日

基于Tensorflow、Keras实现Stable Diffusion，开箱即用实现多GPU推理

基于Tensorflow、Keras实现Stable Diffusion，开箱即用实现多GPU推理

机器之心

1+阅读 · 2022年9月20日

谷歌推出多轴注意力方法，既改进ViT又提升MLP

谷歌推出多轴注意力方法，既改进ViT又提升MLP

机器之心

0+阅读 · 2022年9月9日

已删除

将门创投

12+阅读 · 2019年7月1日

基于新视觉掩藏效应的HEVC感知码率控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机变量结构的模型论

国家自然科学基金

0+阅读 · 2013年12月31日

大规模水电跨省区消纳优化调度建模方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

众核体系架构并行计算模型与算法自适应调优框架研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于信息融合核框架的多时相遥感影像特征级变化检测研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于超高分辨率视频的HEVC低复杂度模型和方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于融合的全向深度图像的生成及应用研究

国家自然科学基金

0+阅读 · 2010年12月31日

容错处理器网格的高效重构技术

国家自然科学基金

0+阅读 · 2009年12月31日

基于边缘点的折反射图像立体匹配与三维重建研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于三维场计算和多值逻辑的模拟计算研究

国家自然科学基金

0+阅读 · 2009年12月31日

Efficient Diffusion Policies for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年5月31日

Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling

Arxiv

0+阅读 · 2023年5月31日

A Unified Conditional Framework for Diffusion-based Image Restoration

Arxiv

0+阅读 · 2023年5月31日

Direct Diffusion Bridge using Data Consistency for Inverse Problems

Arxiv

0+阅读 · 2023年5月31日

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

AMatFormer: Efficient Feature Matching via Anchor Matching Transformer

Arxiv

0+阅读 · 2023年5月30日

DiffMatch: Diffusion Model for Dense Matching

Arxiv

0+阅读 · 2023年5月30日

GlyphControl: Glyph Conditional Control for Visual Text Generation

Arxiv

0+阅读 · 2023年5月29日

DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models

Arxiv

0+阅读 · 2023年5月26日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

VIP会员

文章信息

相关主题

高分辨率图像

相关VIP内容

【ICML2023】通过离散扩散建模实现高效和度引导的图生成

【ICML2023】通过离散扩散建模实现高效和度引导的图生成

专知会员服务

21+阅读 · 2023年5月17日

【AAAI2023】不确定性感知的图像描述生成

【AAAI2023】不确定性感知的图像描述生成

专知会员服务

26+阅读 · 2022年12月4日

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

【NeurIPS2022】不用微调的加速大规模视觉Transformer的密集预测

专知会员服务

14+阅读 · 2022年10月5日

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

【斯坦福CVPR2022】EG3D:高效的几何感知三维生成对抗网络，EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks

专知会员服务

18+阅读 · 2022年3月15日

【ECCV2020】EfficientFCN：语义分割中的整体引导解码器

【ECCV2020】EfficientFCN：语义分割中的整体引导解码器

专知会员服务

18+阅读 · 2020年8月23日

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

专知会员服务

34+阅读 · 2020年4月11日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【ICCV2019最佳论文官方代码】Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"(从单一自然图像中学习的无条件生成模型) 附PDF论文

【ICCV2019最佳论文官方代码】Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"(从单一自然图像中学习的无条件生成模型) 附PDF论文

专知会员服务

22+阅读 · 2019年11月2日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

浅谈扩散模型的有分类器引导和无分类器引导

浅谈扩散模型的有分类器引导和无分类器引导

PaperWeekly

3+阅读 · 2022年12月1日

类数值方法PNDM：Stable Diffusion默认加速采样方案

类数值方法PNDM：Stable Diffusion默认加速采样方案

PaperWeekly

1+阅读 · 2022年11月4日

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

生成扩散模型漫谈：最优扩散方差估计（上）

生成扩散模型漫谈：最优扩散方差估计（上）

PaperWeekly

0+阅读 · 2022年9月25日

基于Tensorflow、Keras实现Stable Diffusion，开箱即用实现多GPU推理

基于Tensorflow、Keras实现Stable Diffusion，开箱即用实现多GPU推理

机器之心

1+阅读 · 2022年9月20日

谷歌推出多轴注意力方法，既改进ViT又提升MLP

谷歌推出多轴注意力方法，既改进ViT又提升MLP

机器之心

0+阅读 · 2022年9月9日

已删除

将门创投

12+阅读 · 2019年7月1日

相关论文

Efficient Diffusion Policies for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年5月31日

Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling

Arxiv

0+阅读 · 2023年5月31日

A Unified Conditional Framework for Diffusion-based Image Restoration

Arxiv

0+阅读 · 2023年5月31日

Direct Diffusion Bridge using Data Consistency for Inverse Problems

Arxiv

0+阅读 · 2023年5月31日

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

AMatFormer: Efficient Feature Matching via Anchor Matching Transformer

Arxiv

0+阅读 · 2023年5月30日

DiffMatch: Diffusion Model for Dense Matching

Arxiv

0+阅读 · 2023年5月30日

GlyphControl: Glyph Conditional Control for Visual Text Generation

Arxiv

0+阅读 · 2023年5月29日

DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models

Arxiv

0+阅读 · 2023年5月26日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

相关基金

基于新视觉掩藏效应的HEVC感知码率控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机变量结构的模型论

国家自然科学基金

0+阅读 · 2013年12月31日

大规模水电跨省区消纳优化调度建模方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

众核体系架构并行计算模型与算法自适应调优框架研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于信息融合核框架的多时相遥感影像特征级变化检测研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于超高分辨率视频的HEVC低复杂度模型和方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于融合的全向深度图像的生成及应用研究

国家自然科学基金

0+阅读 · 2010年12月31日

容错处理器网格的高效重构技术

国家自然科学基金

0+阅读 · 2009年12月31日

基于边缘点的折反射图像立体匹配与三维重建研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于三维场计算和多值逻辑的模拟计算研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员