Dermatological diagnosis automation is essential in addressing the high prevalence of skin diseases and critical shortage of dermatologists. Despite approaching expert-level diagnosis performance, convolutional neural network (ConvNet) adoption in clinical practice is impeded by their limited explainability, and by subjective, expensive explainability validations. We introduce DermX and DermX+, an end-to-end framework for explainable automated dermatological diagnosis. DermX is a clinically-inspired explainable dermatological diagnosis ConvNet, trained using DermXDB, a 554 image dataset annotated by eight dermatologists with diagnoses, supporting explanations, and explanation attention maps. DermX+ extends DermX with guided attention training for explanation attention maps. Both methods achieve near-expert diagnosis performance, with DermX, DermX+, and dermatologist F1 scores of 0.79, 0.79, and 0.87, respectively. We assess the explanation performance in terms of identification and localization by comparing model-selected with dermatologist-selected explanations, and gradient-weighted class-activation maps with dermatologist explanation maps, respectively. DermX obtained an identification F1 score of 0.77, while DermX+ obtained 0.79. The localization F1 score is 0.39 for DermX and 0.35 for DermX+. These results show that explainability does not necessarily come at the expense of predictive power, as our high-performance models provide expert-inspired explanations for their diagnoses without lowering their diagnosis performance.
翻译:皮肤病诊断自动化是解决皮肤疾病高发率和皮肤病学家严重短缺问题的关键。尽管接近专家一级诊断性能,临床实践采用进化神经网络(ConNet)仍受到临床实践的有限解释和主观、昂贵的解释性验证的阻碍。我们引入了DermX和DermX+,这是可解释自动皮肤病诊断的端到端框架。DermX是临床启发的可解释性皮肤病诊断ConvNet, 培训使用DermXDB, 由8名皮肤医生用诊断、支持解释和解释性诊断性关注性地图附加的554个图像数据集。DermX+扩展了DermX, 指导性关注性关注性研究地图。这两种方法都实现了接近专家性诊断性诊断性,分别使用DermX、DermX+和皮肤学家F1分数0.79和0.87。我们通过将选择的模型与皮肤病学家选择性解释性解释性解释性、分量级分类和分级分类图与皮肤X的分数,分别通过DermXxxx为0.7和0.3的成绩分分数。Drexxxxx,为0.15的成绩分分分解为0.7的确定性,用于0.15的确定性成本。