Chest radiography has been a recommended procedure for patient triaging and resource management in intensive care units (ICUs) throughout the COVID-19 pandemic. The machine learning efforts to augment this workflow have been long challenged due to deficiencies in reporting, model evaluation, and failure mode analysis. To address some of those shortcomings, we model radiological features with a human-interpretable class hierarchy that aligns with the radiological decision process. Also, we propose the use of a data-driven error analysis methodology to uncover the blind spots of our model, providing further transparency on its clinical utility. For example, our experiments show that model failures highly correlate with ICU imaging conditions and with the inherent difficulty in distinguishing certain types of radiological features. Also, our hierarchical interpretation and analysis facilitates the comparison with respect to radiologists' findings and inter-variability, which in return helps us to better assess the clinical applicability of models.
翻译:在整个COVID-19大流行期间,乳房X射线学一直是推荐在特护单位进行病人三角和资源管理的一种程序;由于报告、模型评估和故障模式分析方面的缺陷,扩大这种工作流程的机器学习工作长期以来一直受到挑战;为解决其中一些缺陷,我们模拟放射特征时采用了与辐射决策过程相一致的人类可解释等级等级的人类辐射特征;此外,我们提议使用数据驱动错误分析方法来发现我们模型的盲点,进一步提高其临床效用的透明度;例如,我们的实验表明,模型失败与综合护理成像条件和区分某些辐射特征的内在困难密切相关;此外,我们的等级解释和分析有助于比较放射学家的调查结果和可变性,这反过来有助于我们更好地评估模型的临床适用性。