Early detection of breast cancer is a powerful tool towards decreasing its socioeconomic burden. Although, artificial intelligence (AI) methods have shown remarkable results towards this goal, their "black box" nature hinders their wide adoption in clinical practice. To address the need for AI guided breast cancer diagnosis, interpretability methods can be utilized. In this study, we used AI methods, i.e., Random Forests (RF), Neural Networks (NN) and Ensembles of Neural Networks (ENN), towards this goal and explained and optimized their performance through interpretability techniques, such as the Global Surrogate (GS) method, the Individual Conditional Expectation (ICE) plots and the Shapley values (SV). The Wisconsin Diagnostic Breast Cancer (WDBC) dataset of the open UCI repository was used for the training and evaluation of the AI algorithms. The best performance for breast cancer diagnosis was achieved by the proposed ENN (96.6% accuracy and 0.96 area under the ROC curve), and its predictions were explained by ICE plots, proving that its decisions were compliant with current medical knowledge and can be further utilized to gain new insights in the pathophysiological mechanisms of breast cancer. Feature selection based on features' importance according to the GS model improved the performance of the RF (leading the accuracy from 96.49% to 97.18% and the area under the ROC curve from 0.96 to 0.97) and feature selection based on features' importance according to SV improved the performance of the NN (leading the accuracy from 94.6% to 95.53% and the area under the ROC curve from 0.94 to 0.95). Compared to other approaches on the same dataset, our proposed models demonstrated state of the art performance while being interpretable.
翻译:早期检测乳腺癌是降低其社会经济负担的有力工具。虽然人工智能(AI)方法已经展示出实现这一目标的显著成果,但其“黑盒”性质阻碍了其在临床实践中的广泛采用。为了解决AI引导乳腺癌诊断的需要,可以使用可解释的方法。在本研究中,我们使用了AI方法,即随机森林(Rand Forest )、神经网络(NN)和神经网络集合(ENN),以实现这一目标,并通过可解释技术解释和优化其性能,如全球超闭门技术(GS),个人条件预期(ICE)图和夏普利值(SV)等。为了应对AI引导乳腺癌诊断诊断诊断分析的需要,可以使用开放 UCI 仓库的可解释性数据集(WDBC)用于培训和评估AI算法。拟议的ENNUN(96.6%的准确性能和0.96区域(ROC曲线下的0.56区域),其预测由ICE图解,证明它的决定符合当前医学数据(ICE)的准确性(ICE) 4 个人条件(ICE) 和精度(WC) 的精度(WC) 的精度(Oralalal 6) 6) 的精度指标,从目前的精度到新的精度(SL),可以进一步利用从SL 性选择的精度的精度的精度的精度,从目前的精度到不断的精度,从SB的精度,从SBA的精度到不断的精度的精度的精度的精度,从SB。