We systematically evaluate the performance of deep learning models in the presence of diseases not labeled for or present during training. First, we evaluate whether deep learning models trained on a subset of diseases (seen diseases) can detect the presence of any one of a larger set of diseases. We find that models tend to falsely classify diseases outside of the subset (unseen diseases) as "no disease". Second, we evaluate whether models trained on seen diseases can detect seen diseases when co-occurring with diseases outside the subset (unseen diseases). We find that models are still able to detect seen diseases even when co-occurring with unseen diseases. Third, we evaluate whether feature representations learned by models may be used to detect the presence of unseen diseases given a small labeled set of unseen diseases. We find that the penultimate layer of the deep neural network provides useful features for unseen disease detection. Our results can inform the safe clinical deployment of deep learning models trained on a non-exhaustive set of disease classes.
翻译:我们系统地评价深层次学习模式在培训期间未标明或存在的疾病方面的表现。首先,我们评估在一组疾病(见疾病)方面受过训练的深层次学习模式能否发现存在较大系列疾病中的任何一个。我们发现,模型往往错误地将子类以外的疾病(见不到疾病)归类为“无疾病”。第二,我们评估在与子类(见不到疾病)共同发生的疾病(见不到疾病)方面受过训练的关于已发现疾病的模型是否能够检测到已发现的疾病。我们发现,即使在与不可见疾病共同发生的情况下,模型仍然能够检测到已发现的疾病。第三,我们评估模型所学的特征表现是否可用于在少量的隐性疾病中发现未见疾病的存在。我们发现,深层神经网络的倒数层为不可见疾病检测提供了有用的特征。我们的结果可以证明,在临床安全部署经过非详尽的疾病分类培训的深层学习模式。