Under the global pandemic of COVID-19, building an automated framework that quantifies the severity of COVID-19 and localizes the relevant lesion on chest X-ray images has become increasingly important. Although pixel-level lesion severity labels, e.g. lesion segmentation, can be the most excellent target to build a robust model, collecting enough data with such labels is difficult due to time and labor-intensive annotation tasks. Instead, array-based severity labeling that assigns integer scores on six subdivisions of lungs can be an alternative choice enabling the quick labeling. Several groups proposed deep learning algorithms that quantify the severity of COVID-19 using the array-based COVID-19 labels and localize the lesions with explainability maps. To further improve the accuracy and interpretability, here we propose a novel Vision Transformer tailored for both quantification of the severity and clinically applicable localization of the COVID-19 related lesions. Our model is trained in a weakly-supervised manner to generate the full probability maps from weak array-based labels. Furthermore, a novel progressive self-training method enables us to build a model with a small labeled dataset. The quantitative and qualitative analysis on the external testset demonstrates that our method shows comparable performance with radiologists for both tasks with stability in a real-world application.
翻译:在COVID-19这一全球大流行病下,建立一个自动框架,对COVID-19的严重程度进行量化,并对胸部X光图像的相关损伤进行本地化,这已变得日益重要。虽然像素水平的损害严重程度标签,例如,腐蚀分解,可以成为构建一个稳健模型的最优秀目标,但是由于时间和劳动密集型说明任务,难以收集足够数据,用这种标签收集足够数据。相反,在六个肺部子部分配整分数的基于阵列重力标签可能是促成快速标签的替代选择。一些团体提议采用深层次的学习算法,用基于阵列的COVID-19标签量化COVID-19的严重性,并将损伤与可解释性图进行本地化。为了进一步提高准确性和可解释性,我们在这里建议了一个新的视野变变换器,既能量化与临床适用的COVID-19相关损害的严重性,又适合临床适用的本地化。我们的模型经过薄弱的超常度训练,可以用来从薄弱的阵列标签上绘制完整的概率图。此外,一个具有新颖的自我进化的自我分析方法,可以使我们用一种可比较性的数据测试方法来构建一个模拟。