Generative Adversarial Networks (GANs) were introduced by Goodfellow in 2014, and since then have become popular for constructing generative artificial intelligence models. However, the drawbacks of such networks are numerous, like their longer training times, their sensitivity to hyperparameter tuning, several types of loss and optimization functions and other difficulties like mode collapse. Current applications of GANs include generating photo-realistic human faces, animals and objects. However, I wanted to explore the artistic ability of GANs in more detail, by using existing models and learning from them. This dissertation covers the basics of neural networks and works its way up to the particular aspects of GANs, together with experimentation and modification of existing available models, from least complex to most. The intention is to see if state of the art GANs (specifically StyleGAN2) can generate album art covers and if it is possible to tailor them by genre. This was attempted by first familiarizing myself with 3 existing GANs architectures, including the state of the art StyleGAN2. The StyleGAN2 code was used to train a model with a dataset containing 80K album cover images, then used to style images by picking curated images and mixing their styles.
翻译:Goodfellow 于2014年引入了基因生成Adversarial网络(GANs),自此以后,这些网络在建立基因化人工智能模型方面变得很受欢迎。然而,这些网络的缺点很多,如训练时间较长、对超光度调整的敏感度、几类损失和优化功能,以及模式崩溃等其他困难等。GANs目前的应用包括产生摄影现实的人类面孔、动物和物体。然而,我想通过利用现有模型并从中学习,更详细地探索GANs的艺术能力。这一论文涵盖了神经网络的基础,并努力达到GANs的具体方面,同时从最复杂到最不复杂的现有模型的实验和修改。目的是看看GANs(具体地指SysteleGAN2)的状态是否能产生相册封面,如果有可能通过基因图案来调整它们。首先尝试熟悉现有的3个GANs结构,包括艺术StyleGAN2的状态。StyleGAN2代码用来用使用模型来制作模型,然后用使用含有80型相的图像的模版来制作。