Problem: Detecting COVID-19 from chest X-Ray (CXR) images has become one of the fastest and easiest methods for detecting COVID-19. However, the existing methods usually use supervised transfer learning from natural images as a pretraining process. These methods do not consider the unique features of COVID-19 and the similar features between COVID-19 and other pneumonia. Aim: In this paper, we want to design a novel high-accuracy COVID-19 detection method that uses CXR images, which can consider the unique features of COVID-19 and the similar features between COVID-19 and other pneumonia. Methods: Our method consists of two phases. One is self-supervised learning-based pertaining; the other is batch knowledge ensembling-based fine-tuning. Self-supervised learning-based pretraining can learn distinguished representations from CXR images without manually annotated labels. On the other hand, batch knowledge ensembling-based fine-tuning can utilize category knowledge of images in a batch according to their visual feature similarities to improve detection performance. Unlike our previous implementation, we introduce batch knowledge ensembling into the fine-tuning phase, reducing the memory used in self-supervised learning and improving COVID-19 detection accuracy. Results: On two public COVID-19 CXR datasets, namely, a large dataset and an unbalanced dataset, our method exhibited promising COVID-19 detection performance. Our method maintains high detection accuracy even when annotated CXR training images are reduced significantly (e.g., using only 10% of the original dataset). In addition, our method is insensitive to changes in hyperparameters.
翻译:通过自监督学习和批次知识整合提高自动COVID-19检测的性能
问题:从胸部X光(CXR)图像中检测COVID-19已成为最快和最简便的检测COVID-19的方法之一。然而,现有方法通常使用自然图像的监督式迁移学习作为预训练过程。这些方法没有考虑COVID-19的独特特征和COVID-19与其他肺炎之间的相似特征。目的:在本文中,我们希望设计一种使用CXR图像的新型高精度COVID-19检测方法,该方法可以考虑COVID-19的独特特征和COVID-19与其他肺炎之间的相似特征。方法:我们的方法包括两个阶段。一个是基于自监督学习的预训练;另一个是基于批次知识整合的微调。基于自监督学习的预训练可以在不需要人工标注标签的情况下,从CXR图像中学习区分表示。另一方面,基于批次知识整合的微调可以根据它们的视觉特征相似性利用批次中图像的类别知识来提高检测性能。与我们先前的实现不同的是,我们在微调阶段引入了批次知识整合,减少了自监督学习中使用的内存,提高了COVID-19检测精度。结果:在两个公共COVID-19 CXR数据集上,即一个大型数据集和一个不平衡数据集上,我们的方法表现出了有前途的COVID-19检测性能。即使使用原始数据集的仅10%进行注释的CXR训练图像,我们的方法仍然保持高检测精度。此外,我们的方法对超参数的变化不敏感。