瓦塞斯斯坦空间扭曲-反对权衡的理论 (A Theory of the Distortion-Perception Tradeoff in Wasserstein Space)

The lower the distortion of an estimator, the more the distribution of its outputs generally deviates from the distribution of the signals it attempts to estimate. This phenomenon, known as the perception-distortion tradeoff, has captured significant attention in image restoration, where it implies that fidelity to ground truth images comes at the expense of perceptual quality (deviation from statistics of natural images). However, despite the increasing popularity of performing comparisons on the perception-distortion plane, there remains an important open question: what is the minimal distortion that can be achieved under a given perception constraint? In this paper, we derive a closed form expression for this distortion-perception (DP) function for the mean squared-error (MSE) distortion and the Wasserstein-2 perception index. We prove that the DP function is always quadratic, regardless of the underlying distribution. This stems from the fact that estimators on the DP curve form a geodesic in Wasserstein space. In the Gaussian setting, we further provide a closed form expression for such estimators. For general distributions, we show how these estimators can be constructed from the estimators at the two extremes of the tradeoff: The global MSE minimizer, and a minimizer of the MSE under a perfect perceptual quality constraint. The latter can be obtained as a stochastic transformation of the former.

翻译：测谎机的扭曲程度越低,其产出的分布通常就越偏离它所要估计的信号的分布。这一现象被称为感知扭曲取舍,在图像恢复方面引起极大关注,这意味着对地面真象的忠诚是以感知质量为代价的(自然图像统计数据的减少),然而,尽管对感知扭曲平面进行比较越来越受欢迎,但仍然存在一个重要的未决问题:在某种感知限制下可以达到的微小扭曲是什么?在本文中,我们为这种偏差-感知(DP)功能(MESE)的偏差和Wasserstein-2感知指数提供了一种封闭的形式表达方式。我们证明,尽管对真实图像的忠诚度以认知质量为代价,但不管基本分布如何,对真实性图像的忠诚性总是有偏差。这来自这一事实,即对DP曲线的估测点在瓦塞斯坦空间形成一种地理偏差。在Gaussabr 设置时,我们进一步为这种估测谎者提供了一种封闭的表达形式。对于一般分布而言,我们展示的是,这些最精确性前的精度,在后两部的精度,我们从SESA的精度之下可以确定一个最精确的精度。