Foveated image reconstruction recovers full image from a sparse set of samples distributed according to the human visual system's retinal sensitivity that rapidly drops with eccentricity. Recently, the use of Generative Adversarial Networks was shown to be a promising solution for such a task as they can successfully hallucinate missing image information. Like for other supervised learning approaches, also for this one, the definition of the loss function and training strategy heavily influences the output quality. In this work, we pose the question of how to efficiently guide the training of foveated reconstruction techniques such that they are fully aware of the human visual system's capabilities and limitations, and therefore, reconstruct visually important image features. Due to the nature of GAN-based solutions, we concentrate on the human's sensitivity to hallucination for different input sample densities. We present new psychophysical experiments, a dataset, and a procedure for training foveated image reconstruction. The strategy provides flexibility to the generator network by penalizing only perceptually important deviations in the output. As a result, the method aims to preserve perceived image statistics rather than natural image statistics. We evaluate our strategy and compare it to alternative solutions using a newly trained objective metric and user experiments.
翻译:改造后的图像重建从根据人类视觉系统视网膜的视网膜灵敏度迅速下降而迅速下降的零散样本中恢复了全部图像。最近,利用基因反影网络被证明是解决这种任务的一个很有希望的办法,因为它们能够成功地将丢失的图像信息幻觉化。像其他受监督的学习方法一样,对于这一方法,损失功能和培训战略的定义也严重影响了产出质量。在这项工作中,我们提出了如何有效指导受变的重建技术培训的问题,以便他们充分意识到人类视觉系统的能力和局限性,因此,重建视像重要的图像特征。由于基于GAN的解决方案的性质,我们集中关注人类对不同输入样本密度的幻觉的敏感性。我们介绍了新的心理物理实验、数据集以及培养被变形图像重建的程序。该战略通过只惩罚产出中明显重要的偏差,为发电机网络提供了灵活性。结果是,这种方法旨在保存感知到的图像统计,而不是自然图像统计。我们通过新培训的用户实验来评估我们的战略,并将它与替代的解决方案进行比较。