Several deep learning algorithms have been developed to predict survival of cancer patients using whole slide images (WSIs).However, identification of image phenotypes within the WSIs that are relevant to patient survival and disease progression is difficult for both clinicians, and deep learning algorithms. Most deep learning based Multiple Instance Learning (MIL) algorithms for survival prediction use either top instances (e.g., maxpooling) or top/bottom instances (e.g., MesoNet) to identify image phenotypes. In this study, we hypothesize that wholistic information of the distribution of the patch scores within a WSI can predict the cancer survival better. We developed a distribution based multiple-instance survival learning algorithm (DeepDisMISL) to validate this hypothesis. We designed and executed experiments using two large international colorectal cancer WSIs datasets - MCO CRC and TCGA COAD-READ. Our results suggest that the more information about the distribution of the patch scores for a WSI, the better is the prediction performance. Including multiple neighborhood instances around each selected distribution location (e.g., percentiles) could further improve the prediction. DeepDisMISL demonstrated superior predictive ability compared to other recently published, state-of-the-art algorithms. Furthermore, our algorithm is interpretable and could assist in understanding the relationship between cancer morphological phenotypes and patients cancer survival risk.
翻译:已经开发了若干深层次的学习算法来预测癌症患者的生存情况,使用整个幻灯片图像(WSIIs)预测癌症患者的生存情况。然而,对于临床医生和深层次的学习算法来说,在与患者生存和疾病进展有关的WSI中,很难识别图像型的图象。大多数基于深层次学习的多实例学习算法(MIL)用于生存预测的计算法都使用顶级实例(例如,最大集合)或顶级/底级实例(例如,MesoNet)来识别图像的pheno类型。在这项研究中,我们假设的是,关于WSI内部补丁分分布的百分数的百分数信息能更好地预测癌症存活情况。我们开发了一种基于多种内分数生存学习算法(Dep Dismisl)来验证这一假设。我们设计并进行了两个大型国际彩色切除癌症的WSI数据集 - MCO CRC 和TCGACOAD-READ。我们的研究结果表明,关于WSI的可分配点分数类型分数的更多信息可能是进一步的预测性表现。包括每个选定分发地点的多个社区实例(e.MISSLL) 和最近显示的逻辑关系。