Class-Balancing Diffusion Models (Class-Balancing Diffusion Models) - 专知论文

会员服务 ·

0

多样性 · 不平衡 · 数据分布 · 类平衡 · 类不平衡 ·

2023 年 4 月 30 日

Class-Balancing Diffusion Models

翻译：Class-Balancing Diffusion Models

Yiming Qin,Huangjie Zheng,Jiangchao Yao,Mingyuan Zhou,Ya Zhang

from arxiv, Accepted by CVPR2023

Diffusion-based models have shown the merits of generating high-quality visual data while preserving better diversity in recent studies. However, such observation is only justified with curated data distribution, where the data samples are nicely pre-processed to be uniformly distributed in terms of their labels. In practice, a long-tailed data distribution appears more common and how diffusion models perform on such class-imbalanced data remains unknown. In this work, we first investigate this problem and observe significant degradation in both diversity and fidelity when the diffusion model is trained on datasets with class-imbalanced distributions. Especially in tail classes, the generations largely lose diversity and we observe severe mode-collapse issues. To tackle this problem, we set from the hypothesis that the data distribution is not class-balanced, and propose Class-Balancing Diffusion Models (CBDM) that are trained with a distribution adjustment regularizer as a solution. Experiments show that images generated by CBDM exhibit higher diversity and quality in both quantitative and qualitative ways. Our method benchmarked the generation results on CIFAR100/CIFAR100LT dataset and shows outstanding performance on the downstream recognition task.

翻译：类平衡扩散模型摘要：最近的研究表明，基于扩散的模型在生成高质量的视觉数据的同时，更好地保留了数据的多样性。然而，这种观察结果仅在数据分布经过精心预处理并在标签上均匀分布的情况下成立。实际上，长尾数据分布更为常见，扩散模型在这种类不平衡的数据上的性能仍然未知。在这项工作中，我们首先研究了这个问题，并观察到当扩散模型在具有类不平衡分布的数据集上训练时会出现明显的多样性和保真度下降。特别是在尾部类别中，产生的图像大幅度丧失了多样性，我们观察到了严重的模式崩溃问题。为了解决这个问题，我们从数据分布不平衡的假设出发，提出了带有分布调整正则化器的类平衡扩散模型（CBDM）作为解决方案。实验表明，CBDM 生成的图像在定量和定性方面都具有更高的多样性和质量。我们的方法在 CIFAR100/CIFAR100LT 数据集上进行了生成结果的基准测试，并在下游识别任务上展现出了卓越的性能。

0

相关内容

多样性

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

近期必读的5篇顶会ICCV 2021【语义分割】相关论文和代码

专知会员服务

43+阅读 · 2021年8月20日

近期必读的5篇顶会CVPR 2021【图像分类】相关论文和代码

专知会员服务

80+阅读 · 2021年4月7日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MEKK1-MKK4-JNK1信号模块与HO-1的结合位点在神经炎症中的作用和机制

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

带限制条件的凯莱图顶点划分研究

国家自然科学基金

0+阅读 · 2014年12月31日

多约束条件下的目标超分辨检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对老年性骨骼肌肉减少症的作用及分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

多时相InSAR相干性估计研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型抗生素Bagremycins生物合成基因簇的鉴定与解析

国家自然科学基金

0+阅读 · 2012年12月31日

CIECAM02拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

粗集的线性结构及其在粒计算中的拓展研究

国家自然科学基金

0+阅读 · 2009年12月31日

Diffusion Models for Zero-Shot Open-Vocabulary Segmentation

Arxiv

0+阅读 · 2023年6月15日

Training Diffusion Classifiers with Denoising Assistance

Arxiv

0+阅读 · 2023年6月15日

Towards Mode Balancing of Generative Models via Diversity Weights

Arxiv

0+阅读 · 2023年6月15日

Toward Grounded Social Reasoning

Arxiv

0+阅读 · 2023年6月14日

Are minimizers of the Onsager-Machlup functional strong posterior modes?

Arxiv

0+阅读 · 2023年6月14日

Towards Balanced Active Learning for Multimodal Classification

Arxiv

0+阅读 · 2023年6月14日

User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems

Arxiv

0+阅读 · 2023年6月13日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

近期必读的5篇顶会ICCV 2021【语义分割】相关论文和代码

专知会员服务

43+阅读 · 2021年8月20日

近期必读的5篇顶会CVPR 2021【图像分类】相关论文和代码

专知会员服务

80+阅读 · 2021年4月7日

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同作战规划：来自美海军陆战队的大语言模型（LLM）使用教训

对北约军事总部战略规划制定与实施的研究 | 140页

美联参会指南-联合规划与执行概述及政策框架 | 32页

俄罗斯军事规划差异性凸显其思维的重要性 | 2025最新文献

相关资讯

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Diffusion Models for Zero-Shot Open-Vocabulary Segmentation

Arxiv

0+阅读 · 2023年6月15日

Training Diffusion Classifiers with Denoising Assistance

Arxiv

0+阅读 · 2023年6月15日

Towards Mode Balancing of Generative Models via Diversity Weights

Arxiv

0+阅读 · 2023年6月15日

Toward Grounded Social Reasoning

Arxiv

0+阅读 · 2023年6月14日

Are minimizers of the Onsager-Machlup functional strong posterior modes?

Arxiv

0+阅读 · 2023年6月14日

Towards Balanced Active Learning for Multimodal Classification

Arxiv

0+阅读 · 2023年6月14日

User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems

Arxiv

0+阅读 · 2023年6月13日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

MEKK1-MKK4-JNK1信号模块与HO-1的结合位点在神经炎症中的作用和机制

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

带限制条件的凯莱图顶点划分研究

国家自然科学基金

0+阅读 · 2014年12月31日

多约束条件下的目标超分辨检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ghrelin对老年性骨骼肌肉减少症的作用及分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

多时相InSAR相干性估计研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型抗生素Bagremycins生物合成基因簇的鉴定与解析

国家自然科学基金

0+阅读 · 2012年12月31日

CIECAM02拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

粗集的线性结构及其在粒计算中的拓展研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员