我们能不能为胸前X射线采用自我监督的训练前训练? (Can we Adopt Self-supervised Pretraining for Chest X-Rays?)

from arxiv, Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 10 pages

Chest radiograph (or Chest X-Ray, CXR) is a popular medical imaging modality that is used by radiologists across the world to diagnose heart or lung conditions. Over the last decade, Convolutional Neural Networks (CNN), have seen success in identifying pathologies in CXR images. Typically, these CNNs are pretrained on the standard ImageNet classification task, but this assumes availability of large-scale annotated datasets. In this work, we analyze the utility of pretraining on unlabeled ImageNet or Chest X-Ray (CXR) datasets using various algorithms and in multiple settings. Some findings of our work include: (i) supervised training with labeled ImageNet learns strong representations that are hard to beat; (ii) self-supervised pretraining on ImageNet (~1M images) shows performance similar to self-supervised pretraining on a CXR dataset (~100K images); and (iii) the CNN trained on supervised ImageNet can be trained further with self-supervised CXR images leading to improvements, especially when the downstream dataset is on the order of a few thousand images.

翻译：切斯特射线仪(或Chest X-Ray, CXR)是一种流行的医疗成像模式,世界各地的放射学家都使用这种模式来诊断心脏或肺部状况。在过去的十年中,革命神经网络(CNN)在确定 CXR 图像中的病理方面取得了成功。这些CNN在标准图像网络分类任务上受过预先培训,但这假定有大规模附加说明的数据集。在这项工作中,我们分析了无标签图像网或Chest X-Ray (CXR) 数据集预培训的效用,使用各种算法和多种设置。我们工作的一些发现包括:(一) 与标签图像网络(CNN) 的监督下培训学会了难以击败的强烈表现;(二) 图像网络(~1M 图像) 自我监督前培训的性能与CXR 数据集(~ 100K 图像) 自我监控前培训相似;(三) 受监督的CNNM 受监督的图像网络可以进一步培训,使用几部自我监控的CXR 图像,在下游的顺序上可以改进。

相关内容

ImageNet (数据集)

关注 22

ImageNet项目是一个用于视觉对象识别软件研究的大型可视化数据库。超过1400万的图像URL被ImageNet手动注释，以指示图片中的对象;在至少一百万个图像中，还提供了边界框。ImageNet包含2万多个类别; [2]一个典型的类别，如“气球”或“草莓”，包含数百个图像。第三方图像URL的注释数据库可以直接从ImageNet免费获得;但是，实际的图像不属于ImageNet。自2010年以来，ImageNet项目每年举办一次软件比赛，即ImageNet大规模视觉识别挑战赛（ILSVRC），软件程序竞相正确分类检测物体和场景。 ImageNet挑战使用了一个“修剪”的1000个非重叠类的列表。2012年在解决ImageNet挑战方面取得了巨大的突破，被广泛认为是2010年的深度学习革命的开始。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日