It has been demonstrated many times that the behavior of the human visual system is connected to the statistics of natural images. Since machine learning relies on the statistics of training data as well, the above connection has interesting implications when using perceptual distances (which mimic the behavior of the human visual system) as a loss function. In this paper, we aim to unravel the non-trivial relationships between the probability distribution of the data, perceptual distances, and unsupervised machine learning. To this end, we show that perceptual sensitivity is correlated with the probability of an image in its close neighborhood. We also explore the relation between distances induced by autoencoders and the probability distribution of the training data, as well as how these induced distances are correlated with human perception. Finally, we find perceptual distances do not always lead to noticeable gains in performance over Euclidean distance in common image processing tasks, except when data is scarce and the perceptual distance provides regularization. We propose this may be due to a \emph{double-counting} effect of the image statistics, once in the perceptual distance and once in the training procedure.
翻译:已经多次证明人类视觉系统的行为与自然图像的统计相关。 由于机器学习也依赖于培训数据的统计, 当使用感知距离(模仿人类视觉系统的行为)作为损失函数时,上述连接具有令人感兴趣的影响。 在本文中, 我们的目标是解析数据概率分布、 感知距离和不受监督的机器学习之间的非三角关系。 为此, 我们显示感知敏感性与其附近图像的概率相关。 我们还探讨了自动编码器引起的距离与培训数据的概率分布之间的关系, 以及这些感知距离如何与人类感知相关。 最后, 我们发现感知距离在普通图像处理任务中并非总能显著提高Euclide距离的性能, 除非数据稀少, 感知距离提供正规化。 我们提议, 这可能是由于图像统计在感知距离和一次培训过程中产生效果。