It has been demonstrated many times that the behavior of the human visual system is connected to the statistics of natural images. Since machine learning relies on the statistics of training data as well, the above connection has interesting implications when using perceptual distances (which mimic the behavior of the human visual system) as a loss function. In this paper, we aim to unravel the non-trivial relationship between the probability distribution of the data, perceptual distances, and unsupervised machine learning. To this end, we show that perceptual sensitivity is correlated with the probability of an image in its close neighborhood. We also explore the relation between distances induced by autoencoders and the probability distribution of the data used for training them, as well as how these induced distances are correlated with human perception. Finally, we discuss why perceptual distances might not lead to noticeable gains in performance over standard Euclidean distances in common image processing tasks except when data is scarce and the perceptual distance provides regularization.
翻译:已经多次证明人类视觉系统的行为与自然图像的统计相关。 由于机器学习也依赖于培训数据的统计, 上述连接在使用感知距离(模仿人类视觉系统的行为)作为损失函数时会产生有趣的影响。 在本文中, 我们的目标是解析数据概率分布、 感知距离和不受监督的机器学习之间的非三角关系。 为此, 我们显示感知敏感度与其附近图像的概率相关。 我们还探讨了自动编码器引起的距离与用于培训这些数据的数据的概率分布之间的关系, 以及这些感知距离与人类感知之间的关系。 最后, 我们讨论为什么感知距离可能不会导致在普通图像处理任务中, 在标准的欧几里德距离上取得显著的成绩, 除非数据稀少, 感知距离提供正规化 。