损失压缩通用速率扭曲-感官代表 (Universal Rate-Distortion-Perception Representations for Lossy Compression)

In the context of lossy compression, Blau & Michaeli (2019) adopt a mathematical notion of perceptual quality and define the information rate-distortion-perception function, generalizing the classical rate-distortion tradeoff. We consider the notion of universal representations in which one may fix an encoder and vary the decoder to achieve any point within a collection of distortion and perception constraints. We prove that the corresponding information-theoretic universal rate-distortion-perception function is operationally achievable in an approximate sense. Under MSE distortion, we show that the entire distortion-perception tradeoff of a Gaussian source can be achieved by a single encoder of the same rate asymptotically. We then characterize the achievable distortion-perception region for a fixed representation in the case of arbitrary distributions, identify conditions under which the aforementioned results continue to hold approximately, and study the case when the rate is not fixed in advance. This motivates the study of practical constructions that are approximately universal across the RDP tradeoff, thereby alleviating the need to design a new encoder for each objective. We provide experimental results on MNIST and SVHN suggesting that on image compression tasks, the operational tradeoffs achieved by machine learning models with a fixed encoder suffer only a small penalty when compared to their variable encoder counterparts.

翻译：在损失压缩的背景下,Blau & Michaeli (2019年)采用了一种概念性概念,即概念性质量,并定义信息率扭曲感知功能,对古典比率扭曲取舍加以概括;我们考虑了普遍表示的概念,即人们可以修复编码器,并改变解码器,以便在一系列扭曲和感知限制中达到任何点;我们证明,相应的信息理论性普遍率扭曲感知功能在实际操作上是可以实现的;在MSE 扭曲下,我们表明,高斯源的整个扭曲-感知取舍可以通过同一速度的单一编码器实现,而同一速度的单一编码器则可以实现。然后,我们将可实现的扭曲感知度区域描述为在任意分配情况下的固定代表,确定上述结果继续维持大致状态的条件,并在比率未提前固定的情况下研究有关案例。这激励着研究整个RDP交易中大致具有普遍性的实际构造,从而减轻了设计新的新编码交换器的需要,从而可以将每个目标的N源源源值与可变式模型进行比较,我们用SMIS的实验性结果将S-MIS的学习结果作为S-MIS的固定的实验结果。