It is generally believed that the human visual system is biased towards the recognition of shapes rather than textures. This assumption has led to a growing body of work aiming to align deep models' decision-making processes with the fundamental properties of human vision. The reliance on shape features is primarily expected to improve the robustness of these models under covariate shift. In this paper, we revisit the significance of shape-biases for the classification of skin lesion images. Our analysis shows that different skin lesion datasets exhibit varying biases towards individual image features. Interestingly, despite deep feature extractors being inclined towards learning entangled features for skin lesion classification, individual features can still be decoded from this entangled representation. This indicates that these features are still represented in the learnt embedding spaces of the models, but not used for classification. In addition, the spectral analysis of different datasets shows that in contrast to common visual recognition, dermoscopic skin lesion classification, by nature, is reliant on complex feature combinations beyond shape-bias. As a natural consequence, shifting away from the prevalent desire of shape-biasing models can even improve skin lesion classifiers in some cases.
翻译:一般认为,人类视觉系统偏向于对形状的认知,而不是对质谱的认知。这一假设导致越来越多的工作,旨在将深模型的决策过程与人类视觉的基本特性相协调。对形状特征的依赖主要预期会在共变变化的情况下提高这些模型的稳健性。在本文中,我们重新审视形状偏向对于皮肤损伤图像分类的意义。我们的分析表明,不同的皮肤皮肤病数据库对单个图像特征有不同的偏向。有趣的是,尽管深特征提取器倾向于学习皮肤损伤分类的纠缠特征,但个别特征仍然可以从这种缠绕的表示中解码。这表明,这些特征仍然体现在这些模型所学会的嵌入空间中,但并不用于分类。此外,对不同数据集的光谱分析表明,与常见的视觉识别相比,脱热分光层皮肤损害分类从自然角度看,依赖于超越形状和位谱的复杂特征组合。自然后果是,某些形状偏向普遍的形状偏移的模型中,甚至可以改善皮肤变形。