The success of Generative Adversarial Networks (GANs) is largely built upon the adversarial training between a generator (G) and a discriminator (D). They are expected to reach a certain equilibrium where D cannot distinguish the generated images from the real ones. However, such an equilibrium is rarely achieved in practical GAN training, instead, D almost always surpasses G. We attribute one of its sources to the information asymmetry between D and G. We observe that D learns its own visual attention when determining whether an image is real or fake, but G has no explicit clue on which regions to focus on for a particular synthesis. To alleviate the issue of D dominating the competition in GANs, we aim to raise the spatial awareness of G. Randomly sampled multi-level heatmaps are encoded into the intermediate layers of G as an inductive bias. Thus G can purposefully improve the synthesis of certain image regions. We further propose to align the spatial awareness of G with the attention map induced from D. Through this way we effectively lessen the information gap between D and G. Extensive results show that our method pushes the two-player game in GANs closer to the equilibrium, leading to a better synthesis performance. As a byproduct, the introduced spatial awareness facilitates interactive editing over the output synthesis. Demo video and code are available at https://genforce.github.io/eqgan-sa/.
翻译:Generation Adversarial Networks(GANs)的成功在很大程度上建立在发电机(GANs)和导师(D)之间的对抗性培训的基础上。预计它们将达到一定的平衡,D无法将产生的图像与真实图像区分开来。然而,在实际的GAN培训中很少实现这种平衡,D几乎总是超过G。我们将其来源之一归结于D和G之间的信息不对称。我们观察到D在确定一个图像是真实还是假的时,了解自己的视觉关注,但G没有明确的线索说明哪些区域需要关注某一特定合成。为了缓解D主宰GANs的竞争问题,我们的目标是提高G.随机抽样多层次的热映像仪的空间意识,将其编码成GAN的中间层,作为感性偏差。因此GG可以有意地改进某些图像区域的合成。我们进一步建议将G的空间意识与D引出的注意地图相匹配。我们通过这种方式有效地缩小了D和G. 广泛的结果显示我们的方法在GANs的交互性游戏中将Aslapeal-cha dal amal dalizald ex destalization views views by lagiew lavel.