重新思考在以物理为基础的愿景中恢复形象的产生方法:从信息角度进行理论分析 (Rethinking Generative Methods for Image Restoration in Physics-based Vision: A Theoretical Analysis from the Perspective of Information)

End-to-end generative methods are considered a more promising solution for image restoration in physics-based vision compared with the traditional deconstructive methods based on handcrafted composition models. However, existing generative methods still have plenty of room for improvement in quantitative performance. More crucially, these methods are considered black boxes due to weak interpretability and there is rarely a theory trying to explain their mechanism and learning process. In this study, we try to re-interpret these generative methods for image restoration tasks using information theory. Different from conventional understanding, we analyzed the information flow of these methods and identified three sources of information (extracted high-level information, retained low-level information, and external information that is absent from the source inputs) are involved and optimized respectively in generating the restoration results. We further derived their learning behaviors, optimization objectives, and the corresponding information boundaries by extending the information bottleneck principle. Based on this theoretic framework, we found that many existing generative methods tend to be direct applications of the general models designed for conventional generation tasks, which may suffer from problems including over-invested abstraction processes, inherent details loss, and vanishing gradients or imbalance in training. We analyzed these issues with both intuitive and theoretical explanations and proved them with empirical evidence respectively. Ultimately, we proposed general solutions or ideas to address the above issue and validated these approaches with performance boosts on six datasets of three different image restoration tasks.

翻译：与基于手工制作的合成模型的传统解构方法相比,最终到最终的基因组方法被认为是物理学图像恢复的更有希望的解决方案。然而,现有的基因组方法在数量性表现方面仍有很大的改进空间。更重要的是,由于解释能力薄弱,这些方法被视为黑箱,很少有人试图解释其机制和学习过程。在这项研究中,我们试图用信息理论重新解释这些基因组方法,以完成图像恢复任务。不同于常规理解,我们分析了这些方法的信息流动,并确定了三种信息来源(高层次信息、保留低层次信息以及来源投入所缺乏的外部信息),在产生恢复结果时,它们分别涉及并优化了空间。我们进一步通过扩展信息瓶颈原则,从中推导出它们的学习行为、优化目标以及相应的信息界限。基于这一理论框架,我们发现许多现有的基因组方法倾向于直接应用为常规生成任务设计的一般模型,这三种方法可能受到问题的影响,包括过度将抽象过程、内在细节损失、以及来源投入所缺的外部信息),从而产生恢复结果。我们进一步推导出其学习行为行为,并用这些一般的理论推论或推论分析这些推论问题。