Reliable classification and detection of certain medical conditions, in images, with state-of-the-art semantic segmentation networks, require vast amounts of pixel-wise annotation. However, the public availability of such datasets is minimal. Therefore, semantic segmentation with image-level labels presents a promising alternative to this problem. Nevertheless, very few works have focused on evaluating this technique and its applicability to the medical sector. Due to their complexity and the small number of training examples in medical datasets, classifier-based weakly supervised networks like class activation maps (CAMs) struggle to extract useful information from them. However, most state-of-the-art approaches rely on them to achieve their improvements. Therefore, we propose a framework that can still utilize the low-quality CAM predictions of complicated datasets to improve the accuracy of our results. Our framework achieves that by first utilizing lower threshold CAMs to cover the target object with high certainty; second, by combining multiple low-threshold CAMs that even out their errors while highlighting the target object. We performed exhaustive experiments on the popular multi-modal BRATS and prostate DECATHLON segmentation challenge datasets. Using the proposed framework, we have demonstrated an improved dice score of up to 8% on BRATS and 6% on DECATHLON datasets compared to the previous state-of-the-art.
翻译:在图像中可靠地分类和检测某些医疗条件,在图像中,以最先进的语义分解网络为最先进的语义分解网络进行分类和检测,需要大量的像素分解图说明,然而,这类数据集的公开性很少。因此,用图像标签进行语义分解是解决这一问题的一个大有希望的替代办法。然而,很少有工作侧重于评价这一技术及其对医疗部门的适用性。由于这些技术的复杂性以及医疗数据集的培训实例数量很少,因此分类系统基础薄弱、监管薄弱的网络,如课堂启动图(CAMs),很难从中提取有用的信息。然而,大多数最先进的方法都依靠这些数据集来改进。因此,我们提议了一个框架,仍然能够利用对复杂数据集的低质量的CAM预测来提高我们结果的准确性。我们的框架通过首先使用较低门槛的CAM来非常肯定地覆盖目标对象;其次,将许多低门槛的CAMs(甚至排除了它们的目标目标目标目标目标目标)等网络进行整合。我们进行了彻底的实验,在广受欢迎的多式的多式RATS框架上,并展示了我们所展示了先前的BMOL数据,DTSA标准。</s>