GANs:分析和改进 (Why Spectral Normalization Stabilizes GANs: Analysis and Improvements) - 专知论文

会员服务 ·

0

规范化的 · GANs · 控制器 · Performer · Better ·

2021 年 4 月 8 日

Why Spectral Normalization Stabilizes GANs: Analysis and Improvements

翻译：GANs:分析和改进

Zinan Lin,Vyas Sekar,Giulia Fanti

from arxiv, 54 pages, 74 figures

Spectral normalization (SN) is a widely-used technique for improving the stability and sample quality of Generative Adversarial Networks (GANs). However, there is currently limited understanding of why SN is effective. In this work, we show that SN controls two important failure modes of GAN training: exploding and vanishing gradients. Our proofs illustrate a (perhaps unintentional) connection with the successful LeCun initialization. This connection helps to explain why the most popular implementation of SN for GANs requires no hyper-parameter tuning, whereas stricter implementations of SN have poor empirical performance out-of-the-box. Unlike LeCun initialization which only controls gradient vanishing at the beginning of training, SN preserves this property throughout training. Building on this theoretical understanding, we propose a new spectral normalization technique: Bidirectional Scaled Spectral Normalization (BSSN), which incorporates insights from later improvements to LeCun initialization: Xavier initialization and Kaiming initialization. Theoretically, we show that BSSN gives better gradient control than SN. Empirically, we demonstrate that it outperforms SN in sample quality and training stability on several benchmark datasets.

翻译：光谱正常化(SN)是用来改善基因反转网络(GANs)的稳定性和样本质量的一种广泛应用技术。然而,目前对SN之所以有效的原因了解有限。在这项工作中,我们显示SN控制了GAN训练的两种重要的失败模式:爆炸和消失梯度。我们的证明表明与成功的 LeCun 初始化有(可能是无意的)联系。这个联系有助于解释为什么最受欢迎的对GANs的SN实施不需要超参数调整,而更严格地执行SN的实验性能则出局。与只控制在培训开始时消失的梯度的LECun初始化不同,SNN在整个培训中保存了这一属性。我们根据这种理论理解提出新的光谱正常化技术:双向缩放光谱正常化(BSSN),它包含了后来对LEC初始化的改进:Xavier 初始化和Kaiming初始化的洞察力。理论上,我们表明,BSNSN比SN的梯度控制要好一些样本质量标准。

0

相关内容

规范化的

【经典书】计算理论导论，482页pdf

【经典书】计算理论导论，482页pdf

专知会员服务

77+阅读 · 2021年4月10日

最新《生成式对抗网络数学导论》，30页pdf

最新《生成式对抗网络数学导论》，30页pdf

专知会员服务

76+阅读 · 2020年9月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

70+阅读 · 2020年8月2日

GANs最新进展，30页ppt，GANs: the story so far

GANs最新进展，30页ppt，GANs: the story so far

专知会员服务

42+阅读 · 2020年8月2日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

15+阅读 · 2020年6月4日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

119+阅读 · 2020年5月30日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

142+阅读 · 2020年4月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

44+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

30+阅读 · 2019年10月17日

生成对抗网络GANs学习路线

生成对抗网络GANs学习路线

专知

36+阅读 · 2019年6月10日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

GAN猫的脸

机械鸡

11+阅读 · 2017年7月8日

Analysis and Applications of Class-wise Robustness in Adversarial Training

Arxiv

0+阅读 · 2021年6月2日

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

Arxiv

7+阅读 · 2019年10月8日

A Sensitivity Analysis of Attention-Gated Convolutional Neural Networks for Sentence Classification

A Sensitivity Analysis of Attention-Gated Convolutional Neural Networks for Sentence Classification

Arxiv

4+阅读 · 2019年8月25日

Generative Adversarial Networks and Conditional Random Fields for Hyperspectral Image Classification

Arxiv

3+阅读 · 2019年5月12日

Generative Adversarial Autoencoder Networks

Arxiv

10+阅读 · 2018年3月23日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Activation Maximization Generative Adversarial Nets

Arxiv

5+阅读 · 2018年1月30日

Improved Training of Generative Adversarial Networks Using Representative Features

Arxiv

7+阅读 · 2018年1月28日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Arxiv

3+阅读 · 2017年11月3日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】计算理论导论，482页pdf

【经典书】计算理论导论，482页pdf

专知会员服务

77+阅读 · 2021年4月10日

最新《生成式对抗网络数学导论》，30页pdf

最新《生成式对抗网络数学导论》，30页pdf

专知会员服务

76+阅读 · 2020年9月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

70+阅读 · 2020年8月2日

GANs最新进展，30页ppt，GANs: the story so far

GANs最新进展，30页ppt，GANs: the story so far

专知会员服务

42+阅读 · 2020年8月2日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

15+阅读 · 2020年6月4日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

119+阅读 · 2020年5月30日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

142+阅读 · 2020年4月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

44+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

30+阅读 · 2019年10月17日

热门VIP内容

相关资讯

生成对抗网络GANs学习路线

生成对抗网络GANs学习路线

专知

36+阅读 · 2019年6月10日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

GAN猫的脸

机械鸡

11+阅读 · 2017年7月8日

相关论文

Analysis and Applications of Class-wise Robustness in Adversarial Training

Arxiv

0+阅读 · 2021年6月2日

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

Arxiv

7+阅读 · 2019年10月8日

A Sensitivity Analysis of Attention-Gated Convolutional Neural Networks for Sentence Classification

A Sensitivity Analysis of Attention-Gated Convolutional Neural Networks for Sentence Classification

Arxiv

4+阅读 · 2019年8月25日

Generative Adversarial Networks and Conditional Random Fields for Hyperspectral Image Classification

Arxiv

3+阅读 · 2019年5月12日

Generative Adversarial Autoencoder Networks

Arxiv

10+阅读 · 2018年3月23日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

Activation Maximization Generative Adversarial Nets

Arxiv

5+阅读 · 2018年1月30日

Improved Training of Generative Adversarial Networks Using Representative Features

Arxiv

7+阅读 · 2018年1月28日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Arxiv

3+阅读 · 2017年11月3日

微信扫码咨询专知VIP会员