会员服务 ·

17种GAN变体的Keras实现请收好 | GitHub热门开源代码

2018 年 3 月 1 日 量子位 关注前沿科技

夏乙编译整理
量子位出品 | 公众号 QbitAI

△ 来源：Kaggle blog

从2014年诞生至今，生成对抗网络（GAN）始终广受关注，已经出现了200多种有名有姓的变体。

这项“造假神技”的创作范围，已经从最初的手写数字和几百像素小渣图，拓展到了壁纸级高清照片、明星脸，甚至艺术画作。

心痒难耐想赶快入门？

通过自己动手、探索模型代码来学习，当然是坠吼的~如果用简单易上手的Keras框架，那就更赞了。

一位GitHub群众eriklindernoren就发布了17种GAN的Keras实现，得到Keras亲爸爸François Chollet在Twitter上的热情推荐。

干货往下看：

https://github.com/eriklindernoren/Keras-GAN

AC-GAN

带辅助分类器的GAN，全称Auxiliary Classifier GAN。

在这类GAN变体中，生成器生成的每张图像，都带有一个类别标签，鉴别器也会同时针对来源和类别标签给出两个概率分布。

论文中描述的模型，可以生成符合1000个ImageNet类别的128×128图像。

code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/acgan/acgan.py

paper：

Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena, Christopher Olah, Jonathon Shlens
https://arxiv.org/abs/1610.09585

Adversarial Autoencoder

这种模型简称AAE，是一种概率性自编码器，运用GAN，通过将自编码器的隐藏编码向量和任意先验分布进行匹配来进行变分推断，可以用于半监督分类、分离图像的风格和内容、无监督聚类、降维、数据可视化等方面。

在论文中，研究人员给出了用MNIST和多伦多人脸数据集 (TFD)训练的模型所生成的样本。

code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/aae/adversarial_autoencoder.py

paper：

Adversarial Autoencoders
Alireza Makhzani, Jonathon Shlens, Navdeep Jaitly, Ian Goodfellow, Brendan Frey
https://arxiv.org/abs/1511.05644>

BiGAN

全称Bidirectional GAN，也就是双向GAN。这种变体能学习反向的映射，也就是将数据投射回隐藏空间。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/bigan/bigan.py

Paper：

Adversarial Feature Learning
Jeff Donahue, Philipp Krähenbühl, Trevor Darrell
https://arxiv.org/abs/1605.09782

BGAN

虽然简称和上一类变体只差个i，但这两种GAN完全不同。BGAN的全称是boundary-seeking GAN。

原版GAN不适用于离散数据，而BGAN用来自鉴别器的估计差异度量来计算生成样本的重要性权重，为训练生成器来提供策略梯度，因此可以用离散数据进行训练。

BGAN里生成样本的重要性权重和鉴别器的判定边界紧密相关，因此叫做“寻找边界的GAN”。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/bgan/bgan.py

Paper：

Boundary-Seeking Generative Adversarial Networks
R Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio
https://arxiv.org/abs/1702.08431

CC-GAN

这种模型能用半监督学习的方法，修补图像上缺失的部分。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/ccgan/ccgan.py

Paper：

Semi-Supervised Learning with Context-Conditional Generative Adversarial Networks
Emily Denton, Sam Gross, Rob Fergus
https://arxiv.org/abs/1611.06430

CGAN

条件式生成对抗网络，也就是conditional GAN，其中的生成器和鉴别器都以某种外部信息为条件，比如类别标签或者其他形式的数据。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/cgan/cgan.py

Paper：

Conditional Generative Adversarial Nets
Mehdi Mirza, Simon Osindero
https://arxiv.org/abs/1411.1784

Context Encoder

这是一个修补图像的卷积神经网络（CNN），能根据周围像素来生成图像上任意区域的内容。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/context_encoder/context_encoder.py

Paper：

Context Encoders: Feature Learning by Inpainting
Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, Alexei A. Efros
https://arxiv.org/abs/1604.07379>

CoGAN

这类变体全名叫coupled GANs，也就是耦合对抗生成网络，其中包含一对GAN，将两个生成模型前几层、两个辨别模型最后几层的权重分别绑定起来，能学习多个域的图像的联合分布。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/cogan/cogan.py

Paper：

Coupled Generative Adversarial Networks
Ming-Yu Liu, Oncel Tuzel
https://arxiv.org/abs/1606.07536

CycleGAN

这个模型是加州大学伯克利分校的一项研究成果，可以在没有成对训练数据的情况下，实现图像风格的转换。

这些例子，你大概不陌生：

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/cyclegan/cyclegan.py

Paper：

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
https://arxiv.org/abs/1703.10593>

论文原作者开源了Torch和PyTorch的实现代码，详情见项目主页：

https://junyanz.github.io/CycleGAN/

DCGAN

深度卷积生成对抗网络模型是作为无监督学习的一种方法而提出的，GAN在其中是最大似然率技术的一种替代。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/dcgan/dcgan.py

Paper：

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
Alec Radford, Luke Metz, Soumith Chintala
https://arxiv.org/abs/1511.06434

DualGAN

这种变体能够用两组不同域的无标签图像来训练图像翻译器，架构中的主要GAN学习将图像从域U翻译到域V，而它的对偶GAN学习一个相反的过程，形成一个闭环。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/dualgan/dualgan.py

Paper：

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation
Zili Yi, Hao Zhang, Ping Tan, Minglun Gong
https://arxiv.org/abs/1704.02510>

GAN

对，就是Ian Goodfellow那个原版GAN。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/gan/gan.py

Paper：

Generative Adversarial Networks
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
https://arxiv.org/abs/1406.2661

InfoGAN

这个变体是GAN的信息论扩展（information-theoretic extension），能完全无监督地分别学会不同表示。比如在MNIST数据集上，InfoGAN成功地分别学会了书写风格和数字的形状。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/infogan/infogan.py

Paper：

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel
https://arxiv.org/abs/1606.03657

LSGAN

最小平方GAN（Least Squares GAN）的提出，是为了解决GAN无监督学习训练中梯度消失的问题，在鉴别器上使用了最小平方损失函数。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/lsgan/lsgan.py

Paper：

Least Squares Generative Adversarial Networks
Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley
https://arxiv.org/abs/1611.04076

Pix2Pix

这个模型大家应该相当熟悉了。它和CycleGAN出自同一个伯克利团队，是CGAN的一个应用案例，以整张图像作为CGAN中的条件。

在它基础上，衍生出了各种上色Demo，波及猫、人脸、房子、包包、漫画等各类物品，甚至还有人用它来去除（爱情动作片中的）马赛克。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/pix2pix/pix2pix.py

Paper:

Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros
https://arxiv.org/abs/1611.07004

Pix2Pix目前有开源的Torch、PyTorch、TensorFlow、Chainer、Keras模型，详情见项目主页：

https://phillipi.github.io/pix2pix/

SGAN

这个变体的全称非常直白：半监督（Semi-Supervised）生成对抗网络。它通过强制让辨别器输出类别标签，实现了GAN在半监督环境下的训练。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/sgan/sgan.py

Paper:

Semi-Supervised Learning with Generative Adversarial Networks
Augustus Odena
https://arxiv.org/abs/1606.01583

WGAN

这种变体全称Wasserstein GAN，在学习分布上使用了Wasserstein距离，也叫Earth-Mover距离。新模型提高了学习的稳定性，消除了模型崩溃等问题，并给出了在debug或搜索超参数时有参考意义的学习曲线。

本文所介绍repo中的WGAN实现，使用了DCGAN的生成器和辨别器。

Code：

https://github.com/eriklindernoren/Keras-GAN/blob/master/wgan/wgan.py

Paper：

Wasserstein GAN
Martin Arjovsky, Soumith Chintala, Léon Bottou
https://arxiv.org/abs/1701.07875

最后补充一点，作者为了让没有GPU的人也能测试这些实现，比较倾向于使用密集层（dense layer），只要在模型中能得出合理的结果，就不会去用卷积层。

— 完 —

加入社群

量子位AI社群13群开始招募啦，欢迎对AI感兴趣的同学，加小助手微信qbitbot5入群；

此外，量子位专业细分群(自动驾驶、CV、NLP、机器学习等)正在招募，面向正在从事相关领域的工程师及研究人员。

进群请加小助手微信号qbitbot5，并务必备注相应群的关键词~通过审核后我们将邀请进群。（专业群审核较严，敬请谅解）

诚挚招聘

量子位正在招募编辑/记者，工作地点在北京中关村。期待有才气、有热情的同学加入我们！相关细节，请在量子位公众号(QbitAI)对话界面，回复“招聘”两个字。

量子位 QbitAI · 头条号签约作者

վ'ᴗ' ի 追踪AI技术和产品新动态

登录查看更多

相关内容

GAN

关注 91

GAN：生成性对抗网，深度学习模型的一种，在神经网络模型中引入竞争机制，非常流行。

密歇根大学28页最新《GANs生成式对抗网络综述：算法、理论与应用》最新论文，带你全面了解GAN技术趋势

专知会员服务

104+阅读 · 2020年2月17日

一网打尽！100+深度学习模型TensorFlow与Pytorch代码实现集合

专知会员服务

142+阅读 · 2020年1月3日

完整版！《GAN实战：生成对抗网络深度学习》在线书与代码，牛津大学Jakub著作 (附下载)

专知会员服务

354+阅读 · 2019年12月25日

【GitHub实战】Pytorch实现的小样本逼真的视频到视频转换

专知会员服务

36+阅读 · 2019年12月15日

【ICCV2019最佳论文官方代码】Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"(从单一自然图像中学习的无条件生成模型) 附PDF论文

专知会员服务

22+阅读 · 2019年11月2日

【书籍】深度学习框架：PyTorch入门与实践（附代码）

专知会员服务

167+阅读 · 2019年10月28日

TensorFlow官方开源的神经结构学习（Neural Structured Learning）库

专知会员服务

18+阅读 · 2019年10月18日

Keras作者François Chollet推荐的开源图像搜索引擎项目Sis

专知会员服务

30+阅读 · 2019年10月17日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

207+阅读 · 2019年9月30日

【干货】面向深度学习研究者的概率分布基础教程（附代码），庆熙大学| Tae Hwan Jung

专知会员服务

36+阅读 · 2019年9月9日

Github 项目推荐 | PyTorch 实现的 GAN 文本生成框架

AI研习社

35+阅读 · 2019年6月10日

Github项目推荐 | 用TensorFlow 2.0实现CartoonGAN图片卡通化

AI研习社

14+阅读 · 2019年6月9日

Keras作者推荐的Github项目，基于TensorFlow2的生成式模型合集

专知

15+阅读 · 2019年5月17日

Github项目推荐 | GAN评估指标的Tensorflow简单实现

AI研习社

16+阅读 · 2019年4月19日

用PyTorch实现各种GANs（附论文和代码地址）

论智

23+阅读 · 2018年4月24日

用 Keras 搭建 GAN：图像去模糊中的应用（附代码）

AI研习社

8+阅读 · 2018年4月5日

教程 | 在Keras上实现GAN：构建消除图片模糊的应用

机器之心

5+阅读 · 2018年3月27日

Github 项目推荐 | GAN 的 Keras 实现案例集合 —— Keras-GAN

AI研习社

15+阅读 · 2018年2月26日

历史最全GAN网络及其各种变体整理（附论文及代码实现）

深度学习与NLP

16+阅读 · 2018年2月26日

在TensorFlow中对比两大生成模型：VAE与GAN（附测试代码）

数据派THU

5+阅读 · 2017年10月29日

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

Arxiv

7+阅读 · 2019年10月8日

Data Augmentation of Room Classifiers using Generative Adversarial Networks

Arxiv

4+阅读 · 2019年1月10日

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Arxiv

9+阅读 · 2018年10月29日

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Arxiv

6+阅读 · 2018年9月17日

Video-to-Video Synthesis

Arxiv

9+阅读 · 2018年8月20日

Controllable Generative Adversarial Network

Arxiv

5+阅读 · 2018年5月1日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

High-Resolution Deep Convolutional Generative Adversarial Networks

Arxiv

8+阅读 · 2018年1月27日

TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Arxiv

5+阅读 · 2018年1月17日

DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks

Arxiv

9+阅读 · 2018年1月16日

VIP会员

17种GAN变体的Keras实现请收好 | GitHub热门开源代码

夏乙 编译整理量子位 出品 | 公众号 QbitAI

△ 来源：Kaggle blog

https://github.com/eriklindernoren/Keras-GAN

AC-GAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/acgan/acgan.py

Conditional Image Synthesis With Auxiliary Classifier GANsAugustus Odena, Christopher Olah, Jonathon Shlenshttps://arxiv.org/abs/1610.09585

Adversarial Autoencoder

https://github.com/eriklindernoren/Keras-GAN/blob/master/aae/adversarial_autoencoder.py

Adversarial AutoencodersAlireza Makhzani, Jonathon Shlens, Navdeep Jaitly, Ian Goodfellow, Brendan Freyhttps://arxiv.org/abs/1511.05644>

BiGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/bigan/bigan.py

Adversarial Feature LearningJeff Donahue, Philipp Krähenbühl, Trevor Darrellhttps://arxiv.org/abs/1605.09782

BGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/bgan/bgan.py

Boundary-Seeking Generative Adversarial NetworksR Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengiohttps://arxiv.org/abs/1702.08431

CC-GAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/ccgan/ccgan.py

Semi-Supervised Learning with Context-Conditional Generative Adversarial NetworksEmily Denton, Sam Gross, Rob Fergushttps://arxiv.org/abs/1611.06430

CGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/cgan/cgan.py

Conditional Generative Adversarial NetsMehdi Mirza, Simon Osinderohttps://arxiv.org/abs/1411.1784

Context Encoder

https://github.com/eriklindernoren/Keras-GAN/blob/master/context_encoder/context_encoder.py

Context Encoders: Feature Learning by InpaintingDeepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, Alexei A. Efroshttps://arxiv.org/abs/1604.07379>

CoGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/cogan/cogan.py

Coupled Generative Adversarial NetworksMing-Yu Liu, Oncel Tuzelhttps://arxiv.org/abs/1606.07536

CycleGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/cyclegan/cyclegan.py

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial NetworksJun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efroshttps://arxiv.org/abs/1703.10593>

https://junyanz.github.io/CycleGAN/

DCGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/dcgan/dcgan.py

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial NetworksAlec Radford, Luke Metz, Soumith Chintalahttps://arxiv.org/abs/1511.06434

DualGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/dualgan/dualgan.py

DualGAN: Unsupervised Dual Learning for Image-to-Image TranslationZili Yi, Hao Zhang, Ping Tan, Minglun Gonghttps://arxiv.org/abs/1704.02510>

GAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/gan/gan.py

Generative Adversarial NetworksIan J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengiohttps://arxiv.org/abs/1406.2661

InfoGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/infogan/infogan.py

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial NetsXi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeelhttps://arxiv.org/abs/1606.03657

LSGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/lsgan/lsgan.py

Least Squares Generative Adversarial NetworksXudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolleyhttps://arxiv.org/abs/1611.04076

Pix2Pix

https://github.com/eriklindernoren/Keras-GAN/blob/master/pix2pix/pix2pix.py

Image-to-Image Translation with Conditional Adversarial NetworksPhillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efroshttps://arxiv.org/abs/1611.07004

https://phillipi.github.io/pix2pix/

SGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/sgan/sgan.py

Semi-Supervised Learning with Generative Adversarial NetworksAugustus Odenahttps://arxiv.org/abs/1606.01583

WGAN

https://github.com/eriklindernoren/Keras-GAN/blob/master/wgan/wgan.py

Wasserstein GANMartin Arjovsky, Soumith Chintala, Léon Bottouhttps://arxiv.org/abs/1701.07875

相关内容

夏乙编译整理
量子位出品 | 公众号 QbitAI

Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena, Christopher Olah, Jonathon Shlens
https://arxiv.org/abs/1610.09585

Adversarial Autoencoders
Alireza Makhzani, Jonathon Shlens, Navdeep Jaitly, Ian Goodfellow, Brendan Frey
https://arxiv.org/abs/1511.05644>

Adversarial Feature Learning
Jeff Donahue, Philipp Krähenbühl, Trevor Darrell
https://arxiv.org/abs/1605.09782

Boundary-Seeking Generative Adversarial Networks
R Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio
https://arxiv.org/abs/1702.08431

Semi-Supervised Learning with Context-Conditional Generative Adversarial Networks
Emily Denton, Sam Gross, Rob Fergus
https://arxiv.org/abs/1611.06430

Conditional Generative Adversarial Nets
Mehdi Mirza, Simon Osindero
https://arxiv.org/abs/1411.1784

Context Encoders: Feature Learning by Inpainting
Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, Alexei A. Efros
https://arxiv.org/abs/1604.07379>

Coupled Generative Adversarial Networks
Ming-Yu Liu, Oncel Tuzel
https://arxiv.org/abs/1606.07536

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
https://arxiv.org/abs/1703.10593>

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
Alec Radford, Luke Metz, Soumith Chintala
https://arxiv.org/abs/1511.06434

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation
Zili Yi, Hao Zhang, Ping Tan, Minglun Gong
https://arxiv.org/abs/1704.02510>

Generative Adversarial Networks
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
https://arxiv.org/abs/1406.2661

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel
https://arxiv.org/abs/1606.03657

Least Squares Generative Adversarial Networks
Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley
https://arxiv.org/abs/1611.04076

Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros
https://arxiv.org/abs/1611.07004

Semi-Supervised Learning with Generative Adversarial Networks
Augustus Odena
https://arxiv.org/abs/1606.01583

Wasserstein GAN
Martin Arjovsky, Soumith Chintala, Léon Bottou
https://arxiv.org/abs/1701.07875