Deep convolutional neural network (DCNN) models have been widely explored for skin disease diagnosis and some of them have achieved the diagnostic outcomes comparable or even superior to those of dermatologists. However, broad implementation of DCNN in skin disease detection is hindered by small size and data imbalance of the publically accessible skin lesion datasets. This paper proposes a novel single-model based strategy for classification of skin lesions on small and imbalanced datasets. First, various DCNNs are trained on different small and imbalanced datasets to verify that the models with moderate complexity outperform the larger models. Second, regularization DropOut and DropBlock are added to reduce overfitting and a Modified RandAugment augmentation strategy is proposed to deal with the defects of sample underrepresentation in the small dataset. Finally, a novel Multi-Weighted New Loss (MWNL) function and an end-to-end cumulative learning strategy (CLS) are introduced to overcome the challenge of uneven sample size and classification difficulty and to reduce the impact of abnormal samples on training. By combining Modified RandAugment, MWNL and CLS, our single DCNN model method achieved the classification accuracy comparable or superior to those of multiple ensembling models on different dermoscopic image datasets. Our study shows that this method is able to achieve a high classification performance at a low cost of computational resources and inference time, potentially suitable to implement in mobile devices for automated screening of skin lesions and many other malignancies in low resource settings.
翻译:为皮肤疾病诊断广泛探索了深相神经网络(DCNN)模型,其中一些模型取得了可比较甚至优于皮肤学家的诊断结果;然而,在皮肤疾病检测方面广泛实施DCNNN模型受到可公开获取的皮肤损伤数据集规模小和数据不平衡的阻碍;本文件提出了一个新的单一模型战略,用于对小型和不平衡数据集的皮肤损伤进行分类;首先,对各种小型和不平衡的数据集进行了培训,以核实中度复杂度小的模型比较大的模型要高得多;第二,将常规脱落和脱落模型添加到减少过度配制和修改的拉动增强战略,以应对小规模数据集中样本代表性不足的缺陷;最后,提出了新的多视新损失功能和端对端累积学习战略(CLS),以克服低度采样规模和分类难度不均匀的低度模型,并减少异常样本对培训的影响;第二,通过将调成的RandAug、MWNL和DBlock的常规增强战略,以应对小型数据集中样本的缺陷;最后,采用新的多度新《多度新新新新损失》(MWNL) 和高成本方法,在我们的多轨成本分析中实现了我们的多度成本分析中,在高成本中实现。