Recently, large, high-quality public datasets have led to the development of convolutional neural networks that can detect lymph node metastases of breast cancer at the level of expert pathologists. Many cancers, regardless of the site of origin, can metastasize to lymph nodes. However, collecting and annotating high-volume, high-quality datasets for every cancer type is challenging. In this paper we investigate how to leverage existing high-quality datasets most efficiently in multi-task settings for closely related tasks. Specifically, we will explore different training and domain adaptation strategies, including prevention of catastrophic forgetting, for colon and head-and-neck cancer metastasis detection in lymph nodes. Our results show state-of-the-art performance on both cancer metastasis detection tasks. Furthermore, we show the effectiveness of repeated adaptation of networks from one cancer type to another to obtain multi-task metastasis detection networks. Last, we show that leveraging existing high-quality datasets can significantly boost performance on new target tasks and that catastrophic forgetting can be effectively mitigated using regularization.
翻译:最近,大量高质量的公共数据集导致在专家病理学家一级开发了能够检测淋巴结结对乳腺癌转移情况的革命性神经网络。许多癌症,不论其发源地为何,都可转移到淋巴结。然而,收集和注注注每一类癌症的高容量、高质量数据集具有挑战性。在本文件中,我们调查了如何在多任务环境中最有效地利用现有的高质量数据集执行密切相关的任务。具体地说,我们将探索不同的培训和领域适应战略,包括防止灾难性的遗忘,在淋巴结点进行结肠和头颈癌转移的检测。我们的结果显示了两种癌症转移检测任务的最新表现。此外,我们显示了从一种癌症到另一种癌症的反复调整网络以获得多任务分布检测网络的有效性。最后,我们表明利用现有的高质量数据集可以极大地提高新目标任务的业绩,而灾难性的遗忘可以通过正规化来有效减轻。