In recent years, deep learning has been widely used in SAR ATR and achieved excellent performance on the MSTAR dataset. However, due to constrained imaging conditions, MSTAR has data biases such as background correlation, i.e., background clutter properties have a spurious correlation with target classes. Deep learning can overfit clutter to reduce training errors. Therefore, the degree of overfitting for clutter reflects the non-causality of deep learning in SAR ATR. Existing methods only qualitatively analyze this phenomenon. In this paper, we quantify the contributions of different regions to target recognition based on the Shapley value. The Shapley value of clutter measures the degree of overfitting. Moreover, we explain how data bias and model bias contribute to non-causality. Concisely, data bias leads to comparable signal-to-clutter ratios and clutter textures in training and test sets. And various model structures have different degrees of overfitting for these biases. The experimental results of various models under standard operating conditions on the MSTAR dataset support our conclusions. Our code is available at https://github.com/waterdisappear/Data-Bias-in-MSTAR.
翻译:近年来,深度学习在SAR目标识别中被广泛应用,并在MSTAR数据集上取得了出色的性能。然而,由于成像条件受限,MSTAR存在背景相关性等数据偏见,即背景杂波特性与目标类别有虚假相关性。深度学习可以过度适应杂波以减少训练误差。因此,过度适应杂波的程度反映了SAR目标识别中深度学习的非因果性。现有方法仅定性分析这种现象。在本文中,我们基于Shapley值量化不同区域对目标识别的贡献。 杂波的Shapley值测量过度拟合的程度。此外,我们解释了数据偏见和模型偏见对非因果性的贡献。简言之,数据偏见导致训练集和测试集具有可比较的信号杂波比和杂波纹理。不同的模型结构对这些偏差有不同程度的过度拟合。在MSTAR数据集上进行标准操作条件下的各种模型的实验结果支持我们的结论。我们的代码可在https://github.com/waterdisappear/Data-Bias-in-MSTAR中获得。