Rare diseases are characterized by low prevalence and are often chronically debilitating or life-threatening. Imaging-based classification of rare diseases is challenging due to the severe shortage in training examples. Few-shot learning (FSL) methods tackle this challenge by extracting generalizable prior knowledge from a large base dataset of common diseases and normal controls, and transferring the knowledge to rare diseases. Yet, most existing methods require the base dataset to be labeled and do not make full use of the precious examples of the rare diseases. To this end, we propose in this work a novel hybrid approach to rare disease classification, featuring two key novelties targeted at the above drawbacks. First, we adopt the unsupervised representation learning (URL) based on self-supervising contrastive loss, whereby to eliminate the overhead in labeling the base dataset. Second, we integrate the URL with pseudo-label supervised classification for effective self-distillation of the knowledge about the rare diseases, composing a hybrid approach taking advantages of both unsupervised and (pseudo-) supervised learning on the base dataset. Experimental results on classification of rare skin lesions show that our hybrid approach substantially outperforms existing FSL methods (including those using fully supervised base dataset) for rare disease classification via effective integration of the URL and pseudo-label driven self-distillation, thus establishing a new state of the art.
翻译:稀有疾病的特点是流行率低,而且往往具有慢性衰弱性或生命威胁。由于培训实例严重短缺,基于成像的稀有疾病的分类具有挑战性。少见的学习(FSL)方法通过从庞大的普通疾病和正常控制的基数据库数据集中提取一般先前知识,并将知识转移到稀有疾病,来应对这一挑战。然而,大多数现有方法都要求将基础数据集贴上标签,而不是充分利用稀有疾病的宝贵例子。为此,我们建议对稀有疾病分类采用一种新的混合方法,以上述缺陷为对象,有两个关键的新颖之处。首先,我们采用基于自我监督对比损失的不受监督的代表学习(URL)方法,从而消除基础数据集标签方面的间接损失。第二,我们将URL与假标签监督的分类结合起来,以便有效地自我提炼稀有疾病知识,同时采用混合方法,同时利用未经强化的和(假的)监督的稀有疾病分类方法,在基础数据集中进行实验性的代表学习。通过监管的稀有皮肤标签法的自我升级方法,通过完全监管的混合的自我标签法显示我们现有的混合的混合病变压压式的自我标签方法。