The long-tailed distribution datasets poses great challenges for deep learning based classification models on how to handle the class imbalance problem. Existing solutions usually involve class-balacing strategies or transfer learing from head- to tail-classes or use two-stages learning strategy to re-train the classifier. However, the existing methods are difficult to solve the low quality problem when images are obtained by SAR. To address this problem, we establish a novel three-stages training strategy, which has excellent results for processing SAR image datasets with long-tailed distribution. Specifically, we divide training procedure into three stages. The first stage is to use all kinds of images for rough-training, so as to get the rough-training model with rich content. The second stage is to make the rough model learn the feature expression by using the residual dataset with the class 0 removed. The third stage is to fine tune the model using class-balanced datasets with all 10 classes (including the overall model fine tuning and classifier re-optimization). Through this new training strategy, we only use the information of SAR image dataset and the network model with very small parameters to achieve the top 1 accuracy of 22.34 in development phase.
翻译:长尾分发数据集对于如何处理阶级不平衡问题的深层次学习基于分类的分类模型提出了巨大挑战。现有解决方案通常包括类压战略或将各种图像从头类转向尾类,或者使用两阶段学习战略对分类器进行再培训。然而,当图像通过合成孔径雷达获得时,现有方法难以解决低质量问题。为解决这一问题,我们制定了一个新的三阶段培训战略,在以长尾分发方式处理合成孔径雷达图像数据集方面有极好的结果。具体地说,我们将培训程序分为三个阶段。第一阶段是使用各种图像进行粗糙培训,以便获得内容丰富的粗糙培训模式。第二阶段是使粗略模型通过使用零级的残余数据集来学习特征表达。第三阶段是用所有10级的平衡数据集调整模型(包括总体模型微调和分类再优化)。通过这一新的培训战略,我们只使用合成孔径雷达图像数据集的信息和网络模型,以最小的参数在1级中达到最小的精确度。