Supervised learning tasks such as cancer survival prediction from gigapixel whole slide images (WSIs) are a critical challenge in computational pathology that requires modeling complex features of the tumor microenvironment. These learning tasks are often solved with deep multi-instance learning (MIL) models that do not explicitly capture intratumoral heterogeneity. We develop a novel variance pooling architecture that enables a MIL model to incorporate intratumoral heterogeneity into its predictions. Two interpretability tools based on representative patches are illustrated to probe the biological signals captured by these models. An empirical study with 4,479 gigapixel WSIs from the Cancer Genome Atlas shows that adding variance pooling onto MIL frameworks improves survival prediction performance for five cancer types.
翻译:在计算病理学中,需要建模肿瘤微环境的复杂特征。这些学习任务往往通过深层多因子学习模型(MIL)来解决,这些模型没有明确捕捉地表内异质性。我们开发了一个新的差异组合结构,使MIL模型能够将地表内异质性纳入其预测中。用具有代表性的补丁演示了两个可解释工具,以探探探这些模型所捕捉的生物信号。由癌症基因组图集提供的4,479千兆瓦的实验研究显示,在MIL框架中增加差异集合可以提高五类癌症的存活预测性。