锥体:自定制一代传播模型中的概念中的新元</s> (Cones: Concept Neurons in Diffusion Models for Customized Generation) - 专知论文

会员服务 ·

0

神经元 · 簇 · MoDELS · Networking · 可辨认的 ·

2023 年 3 月 9 日

Cones: Concept Neurons in Diffusion Models for Customized Generation

翻译：锥体:自定制一代传播模型中的概念中的新元

Zhiheng Liu,Ruili Feng,Kai Zhu,Yifei Zhang,Kecheng Zheng,Yu Liu,Deli Zhao,Jingren Zhou,Yang Cao

Human brains respond to semantic features of presented stimuli with different neurons. It is then curious whether modern deep neural networks admit a similar behavior pattern. Specifically, this paper finds a small cluster of neurons in a diffusion model corresponding to a particular subject. We call those neurons the concept neurons. They can be identified by statistics of network gradients to a stimulation connected with the given subject. The concept neurons demonstrate magnetic properties in interpreting and manipulating generation results. Shutting them can directly yield the related subject contextualized in different scenes. Concatenating multiple clusters of concept neurons can vividly generate all related concepts in a single image. A few steps of further fine-tuning can enhance the multi-concept capability, which may be the first to manage to generate up to four different subjects in a single image. For large-scale applications, the concept neurons are environmentally friendly as we only need to store a sparse cluster of int index instead of dense float32 values of the parameters, which reduces storage consumption by 90\% compared with previous subject-driven generation methods. Extensive qualitative and quantitative studies on diverse scenarios show the superiority of our method in interpreting and manipulating diffusion models.

翻译：人类大脑对不同神经神经元的演示刺激的语义特征作出反应。然后令人好奇的是,现代深层神经网络是否接受类似的行为模式。具体地说, 本文在与特定主题相对应的传播模型中发现一小组神经元。我们将这些神经元称为概念神经元。这些神经元可以通过网络梯度的统计与与与特定主题相关的刺激来识别。概念神经元在解释和操控生成结果时表现出磁性特性。关闭它们可以直接产生不同场景的相关主题背景。配置多组概念神经元能够在一个图像中生动地生成所有相关的概念。进一步微调的几步步骤可以增强多种概念能力, 而这些能力可能是第一个在单一图像中生成最多四个不同主题的神经元。对于大型应用来说, 概念神经元具有环境友好性, 因为我们只需要储存一个稀疏的内在指数群, 而不是密集的浮点32 值, 与先前的受主题驱动的生成方法相比, 将存储消耗量减少 90° 。对不同情景进行广泛的定性和定量研究, 显示我们在解释和操控模型中的方法的优越性。</s>

0

相关内容

神经元

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

高效GaN基绿光LED研究

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

超低能Si团簇负离子束沉积制备硅烯(silicene)

国家自然科学基金

0+阅读 · 2012年12月31日

各向异性银纳米粒子自组装的纤维结构生色及其光谱特性与调控

国家自然科学基金

0+阅读 · 2012年12月31日

快速凝固高熵合金的微结构控制及其形成机理

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2表面光催化动力学的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

高能量、窄线宽两波耦合双腔ZGP-OPO中红外参量研究

国家自然科学基金

0+阅读 · 2012年12月31日

液相法制备钒酸铋光催化剂及其光催化活性增强机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

银河系内高温气体的分布和起源

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

AI-Assisted Ethics? Considerations of AI Simulation for the Ethical Assessment and Design of Assistive Technologies

Arxiv

0+阅读 · 2023年4月30日

Blended Latent Diffusion

Arxiv

0+阅读 · 2023年4月30日

Causal effects of intervening variables in settings with unmeasured confounding

Arxiv

0+阅读 · 2023年4月29日

Towards Automated Circuit Discovery for Mechanistic Interpretability

Arxiv

0+阅读 · 2023年4月28日

Evaluating the Stability of Semantic Concept Representations in CNNs for Robust Explainability

Arxiv

0+阅读 · 2023年4月28日

Generative Diffusion Models on Graphs: Methods and Applications

Arxiv

1+阅读 · 2023年4月28日

MUDiff: Unified Diffusion for Complete Molecule Generation

Arxiv

0+阅读 · 2023年4月28日

A Systematic Survey on Deep Generative Models for Graph Generation

Arxiv

18+阅读 · 2022年10月4日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Curriculum Learning: A Survey

Arxiv

24+阅读 · 2021年1月25日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向性能、成本效益、云边隐私与可信性的大小语言模型协作综述

乌克兰太空研究（2022-2024年） | 176页

【CMU博士论文】大型语言模型的隐性特性

国防领域人工智能走向何方？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

AI-Assisted Ethics? Considerations of AI Simulation for the Ethical Assessment and Design of Assistive Technologies

Arxiv

0+阅读 · 2023年4月30日

Blended Latent Diffusion

Arxiv

0+阅读 · 2023年4月30日

Causal effects of intervening variables in settings with unmeasured confounding

Arxiv

0+阅读 · 2023年4月29日

Towards Automated Circuit Discovery for Mechanistic Interpretability

Arxiv

0+阅读 · 2023年4月28日

Evaluating the Stability of Semantic Concept Representations in CNNs for Robust Explainability

Arxiv

0+阅读 · 2023年4月28日

Generative Diffusion Models on Graphs: Methods and Applications

Arxiv

1+阅读 · 2023年4月28日

MUDiff: Unified Diffusion for Complete Molecule Generation

Arxiv

0+阅读 · 2023年4月28日

A Systematic Survey on Deep Generative Models for Graph Generation

Arxiv

18+阅读 · 2022年10月4日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Curriculum Learning: A Survey

Arxiv

24+阅读 · 2021年1月25日

相关基金

高效GaN基绿光LED研究

国家自然科学基金

0+阅读 · 2013年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

超低能Si团簇负离子束沉积制备硅烯(silicene)

国家自然科学基金

0+阅读 · 2012年12月31日

各向异性银纳米粒子自组装的纤维结构生色及其光谱特性与调控

国家自然科学基金

0+阅读 · 2012年12月31日

快速凝固高熵合金的微结构控制及其形成机理

国家自然科学基金

0+阅读 · 2012年12月31日

TiO2表面光催化动力学的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

高能量、窄线宽两波耦合双腔ZGP-OPO中红外参量研究

国家自然科学基金

0+阅读 · 2012年12月31日

液相法制备钒酸铋光催化剂及其光催化活性增强机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

银河系内高温气体的分布和起源

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员