Intrinsic image decomposition (IID) is an under-constrained problem. Therefore, traditional approaches use hand crafted priors to constrain the problem. However, these constraints are limited when coping with complex scenes. Deep learning-based approaches learn these constraints implicitly through the data, but they often suffer from dataset biases (due to not being able to include all possible imaging conditions). In this paper, a combination of the two is proposed. Component specific priors like semantics and invariant features are exploited to obtain semantically and physically plausible reflectance transitions. These transitions are used to steer a progressive CNN with implicit homogeneity constraints to decompose reflectance and shading maps. An ablation study is conducted showing that the use of the proposed priors and progressive CNN increase the IID performance. State of the art performance on both our proposed dataset and the standard real-world IIW dataset shows the effectiveness of the proposed method. Code is made available at https://github.com/Morpheus3000/SIGNet
翻译:内在图像分解(IID)是一个受限制不足的问题。 因此,传统方法使用手工制作的前缀来遏制问题。 但是,这些限制在应对复杂场景时是有限的。深层次的学习方法通过数据隐含地了解这些限制,但往往受到数据集偏差的影响(因为无法包括所有可能的图像条件)。本文提出了两者的结合。诸如语义和变异特征等具体组成部分的前缀被利用以获得语义和物理上看似可信的反射过渡。这些过渡被用来引导一个带有隐含同质性限制的渐进式CNNCNN去拆解反射和阴影图。正在进行一项反动研究,表明使用拟议的前置和进步CNN提高了IID的性能。我们拟议的数据集和标准真实世界IIW数据集的艺术表现状况显示了拟议方法的有效性。代码见https://github.com/Morpheus 300/Signatetet。