Super-resolution (SR) is a fundamental and representative task of low-level vision area. It is generally thought that the features extracted from the SR network have no specific semantic information, and the network simply learns complex non-linear mappings from input to output. Can we find any "semantics" in SR networks? In this paper, we give affirmative answers to this question. By analyzing the feature representations with dimensionality reduction and visualization, we successfully discover the deep semantic representations in SR networks, \textit{i.e.}, deep degradation representations (DDR), which relate to the image degradation types and degrees. We also reveal the differences in representation semantics between classification and SR networks. Through extensive experiments and analysis, we draw a series of observations and conclusions, which are of great significance for future work, such as interpreting the intrinsic mechanisms of low-level CNN networks and developing new evaluation approaches for blind SR.
翻译:超分辨率(SR)是低水平视觉领域的一个基本和具有代表性的任务。一般认为,从SR网络中提取的特征没有具体的语义信息,而网络只是从输入到输出中学习复杂的非线性绘图。我们能否在SR网络中找到任何“语义学”?在本文中,我们对这一问题给出肯定答案。通过分析维度降低和可视化的特征表现,我们成功地发现了SR网络中与图像退化类型和程度有关的深层语义表达(DDR),我们还揭示了分类和SR网络在表达语义方面的差异。通过广泛的实验和分析,我们得出了一系列对未来工作具有重大意义的观察和结论,例如解释低级CNN网络的内在机制,并为盲人SR制定新的评价方法。