Prediction of survival in patients diagnosed with a brain tumour is challenging because of heterogeneous tumour behaviours and responses to treatment. Better estimations of prognosis would support treatment planning and patient support. Advances in machine learning have informed development of clinical predictive models, but their integration into clinical practice is almost non-existent. One reasons for this is the lack of interpretability of models. In this paper, we use a novel brain tumour dataset to compare two interpretable rule list models against popular machine learning approaches for brain tumour survival prediction. All models are quantitatively evaluated using standard performance metrics. The rule lists are also qualitatively assessed for their interpretability and clinical utility. The interpretability of the black box machine learning models is evaluated using two post-hoc explanation techniques, LIME and SHAP. Our results show that the rule lists were only slightly outperformed by the black box models. We demonstrate that rule list algorithms produced simple decision lists that align with clinical expertise. By comparison, post-hoc interpretability methods applied to black box models may produce unreliable explanations of local model predictions. Model interpretability is essential for understanding differences in predictive performance and for integration into clinical practice.
翻译:预测诊断患有脑肿瘤的病人的存活率是难以预测的,因为各种肿瘤行为和治疗反应各异。更好的预测估计将有助于治疗规划和病人支助。机器学习的进展为临床预测模型的开发提供了信息,但这种进步几乎不存在,其原因之一是模型缺乏可解释性。在本文中,我们使用新的脑肿瘤数据集来比较两种可解释的规则清单模型与流行的脑肿瘤生存预测机学习方法。所有模型都使用标准的性能指标进行定量评估。规则清单还对其可解释性和临床效用进行定性评估。黑盒机器学习模型的可解释性是使用两种后热分析技术,即LIME和SHAP来评估的。我们的结果显示,规则清单清单只是略高于黑盒模型。我们证明,规则清单的算法产生了与临床专门知识相匹配的简单决定清单。相比之下,对黑盒模型应用的后可解释方法可能会产生不可靠的当地模型预测解释性。模型的解释性对于理解预测性差异和融入临床实践至关重要。