Deep learning-based melanoma classification with dermoscopic images has recently shown great potential in automatic early-stage melanoma diagnosis. However, limited by the significant data imbalance and obvious extraneous artifacts, i.e., the hair and ruler markings, discriminative feature extraction from dermoscopic images is very challenging. In this study, we seek to resolve these problems respectively towards better representation learning for lesion features. Specifically, a GAN-based data augmentation (GDA) strategy is adapted to generate synthetic melanoma-positive images, in conjunction with the proposed implicit hair denoising (IHD) strategy. Wherein the hair-related representations are implicitly disentangled via an auxiliary classifier network and reversely sent to the melanoma-feature extraction backbone for better melanoma-specific representation learning. Furthermore, to train the IHD module, the hair noises are additionally labeled on the ISIC2020 dataset, making it the first large-scale dermoscopic dataset with annotation of hair-like artifacts. Extensive experiments demonstrate the superiority of the proposed framework as well as the effectiveness of each component. The improved dataset publicly avaliable at https://github.com/kirtsy/DermoscopicDataset.
翻译:最近,由于数据严重不平衡和明显的外在文物(即毛发和标尺标记),因此很难从脱moscopic图像中进行歧视性特征的提取,因此非常具有挑战性。在本研究中,我们力求分别解决这些问题,以更好地体现对腐蚀特征的描述性学习。具体地说,以GAN为基础的数据增强(GDA)战略经过调整,以生成合成色素阳性图像,并结合拟议的隐性头发脱色(IHD)战略。在与头发有关的表象中,通过辅助分类器网络隐含地分解,并反向发送到色素-纤维提取主干柱,以便进行更好的黑素特定表征学习。此外,为了培训IHD模块,头发噪音还被贴在ISIC2020数据集上,使它成为第一个大型的色素脱色素数据集,并附有发型工艺品的注释。广泛的实验展示了拟议框架的优越性,作为每部分的公开数据。