Multiple instance learning (MIL) is a powerful tool to solve the weakly supervised classification in whole slide image (WSI) based pathology diagnosis. However, the current MIL methods are usually based on independent and identical distribution hypothesis, thus neglect the correlation among different instances. To address this problem, we proposed a new framework, called correlated MIL, and provided a proof for convergence. Based on this framework, we devised a Transformer based MIL (TransMIL), which explored both morphological and spatial information. The proposed TransMIL can effectively deal with unbalanced/balanced and binary/multiple classification with great visualization and interpretability. We conducted various experiments for three different computational pathology problems and achieved better performance and faster convergence compared with state-of-the-art methods. The test AUC for the binary tumor classification can be up to 93.09% over CAMELYON16 dataset. And the AUC over the cancer subtypes classification can be up to 96.03% and 98.82% over TCGA-NSCLC dataset and TCGA-RCC dataset, respectively.
翻译:多实例学习(MIL)是解决整个幻灯片图像基于病理学诊断中监管不力的分类的有力工具,然而,目前的MIL方法通常以独立和相同的分布假设为基础,从而忽视了不同实例之间的相互关系。为了解决这个问题,我们提出了一个新框架,称为关联的MIL, 并提供了趋同的证明。我们根据这个框架设计了一个基于变异器的MIL(TransMIL),它探索了形态和空间信息。拟议的 TransMIL可以有效地处理不平衡/平衡和二进制/多重分类,并且具有很高的可视化和可解释性。我们为三种不同的计算病理问题进行了各种实验,并取得了更好的性能和更快的趋同。用于二进肿瘤分类的AUC用于CAMELYON16数据集的测试可以高达93.09%。而癌症子类型分类的AUC可分别高达96.03%和98.82%的TCGA-NSCLC数据集和TCGA-RCC数据集。