Purpose: To investigate chest radiograph (CXR) classification performance of vision transformers (ViT) and interpretability of attention-based saliency using the example of pneumothorax classification. Materials and Methods: In this retrospective study, ViTs were fine-tuned for lung disease classification using four public data sets: CheXpert, Chest X-Ray 14, MIMIC CXR, and VinBigData. Saliency maps were generated using transformer multimodal explainability and gradient-weighted class activation mapping (GradCAM). Classification performance was evaluated on the Chest X-Ray 14, VinBigData, and SIIM-ACR data sets using the area under the receiver operating characteristic curve analysis (AUC) and compared with convolutional neural networks (CNNs). The explainability methods were evaluated with positive/negative perturbation, sensitivity-n, effective heat ratio, intra-architecture repeatability and interarchitecture reproducibility. In the user study, three radiologists classified 160 CXRs with/without saliency maps for pneumothorax and rated their usefulness. Results: ViTs had comparable CXR classification AUCs compared with state-of-the-art CNNs 0.95 (95% CI: 0.943, 0.950) versus 0.83 (95%, CI 0.826, 0.842) on Chest X-Ray 14, 0.84 (95% CI: 0.769, 0.912) versus 0.83 (95% CI: 0.760, 0.895) on VinBigData, and 0.85 (95% CI: 0.847, 0.861) versus 0.87 (95% CI: 0.868, 0.882) on SIIM ACR. Both saliency map methods unveiled a strong bias toward pneumothorax tubes in the models. Radiologists found 47% of the attention-based saliency maps useful and 39% of GradCAM. The attention-based methods outperformed GradCAM on all metrics. Conclusion: ViTs performed similarly to CNNs in CXR classification, and their attention-based saliency maps were more useful to radiologists and outperformed GradCAM.
翻译:目的: 调查视觉变压器的胸透射( CXR) 分类性能( VIDC), 并使用肺炎球体的分类法来解读关注的显著性。 材料和方法 : 在本次回顾研究中, ViT使用四个公共数据集对肺病分类进行了精细调整: CheXpert, Chest X-Ray 14, MIMIC XR 14, 和 VNBC 。 使用变压器多式解析( GradCAM ) 和梯度级激活绘图( GradCAM ) 。 在Chest X Ray 14, VinBigData, SIIM-AC 数据组使用接收器运行曲线分析的区域( AUSC) 并使用脉冲电流分析( CheX) 。 以正/ 内振荡性透析器、 感应力、 有效的热率比率、 内电压重复性和分解( 用户研究中, 将160 CX 的CX 直径图分为160/无色图, 比较的CR8- 95 CISAL5) 。</s>