Cancer detection and classification from gigapixel whole slide images of stained tissue specimens has recently experienced enormous progress in computational histopathology. The limitation of available pixel-wise annotated scans shifted the focus from tumor localization to global slide-level classification on the basis of (weakly-supervised) multiple-instance learning despite the clinical importance of local cancer detection. However, the worse performance of these techniques in comparison to fully supervised methods has limited their usage until now for diagnostic interventions in domains of life-threatening diseases such as cancer. In this work, we put the focus back on tumor localization in form of a patch-level classification task and take up the setting of so-called coarse annotations, which provide greater training supervision while remaining feasible from a clinical standpoint. To this end, we present a novel ensemble method that not only significantly improves the detection accuracy of metastasis on the open CAMELYON16 data set of sentinel lymph nodes of breast cancer patients, but also considerably increases its robustness against noise while training on coarse annotations. Our experiments show that better results can be achieved with our technique making it clinically feasible to use for cancer diagnosis and opening a new avenue for translational and clinical research.
翻译:摘要:近年来,计算组织学领域的巨量整张组织切片染色图像中的癌症检测和分类取得了巨大进展。由于缺乏可用的像素级标注扫描,使得从肿瘤定位转向基于(弱监督的)多实例学习的全局级分类,尽管局部癌症检测的临床重要性。然而,这些技术与完全监督方法相比的性能更差,一直限制了它们在诸如癌症等致命疾病领域的诊断干预中的使用。在本研究中,我们重新将焦点放回到以一种基于修补程序级分类任务的肿瘤定位,并采用了所谓的粗略标注设置,这提供了更多的训练监督,并且从临床角度来看仍然是可行的。为此,我们提出了一种新的集成方法,它不仅显著提高了对女性乳癌患者哨兵淋巴结的开放式 CAMELYON16 数据集上转移的检测精度,而且还在训练粗略注释时显著增加了其鲁棒性。我们的实验表明,使用我们的技术可以实现更好的结果,从而使其在癌症诊断上具备临床可行性,并为转化和临床研究开辟了新途径。