This paper presents a novel positive and negative set selection strategy for contrastive learning of medical images based on labels that can be extracted from clinical data. In the medical field, there exists a variety of labels for data that serve different purposes at different stages of a diagnostic and treatment process. Clinical labels and biomarker labels are two examples. In general, clinical labels are easier to obtain in larger quantities because they are regularly collected during routine clinical care, while biomarker labels require expert analysis and interpretation to obtain. Within the field of ophthalmology, previous work has shown that clinical values exhibit correlations with biomarker structures that manifest within optical coherence tomography (OCT) scans. We exploit this relationship between clinical and biomarker data to improve performance for biomarker classification. This is accomplished by leveraging the larger amount of clinical data as pseudo-labels for our data without biomarker labels in order to choose positive and negative instances for training a backbone network with a supervised contrastive loss. In this way, a backbone network learns a representation space that aligns with the clinical data distribution available. Afterwards, we fine-tune the network trained in this manner with the smaller amount of biomarker labeled data with a cross-entropy loss in order to classify these key indicators of disease directly from OCT scans. Our method is shown to outperform state of the art self-supervised methods by as much as 5% in terms of accuracy on individual biomarker detection.
翻译:本文为根据临床数据提取的标签对医学图像进行对比性学习,提出了一个新的积极和消极选择战略。在医学领域,存在用于诊断和治疗过程不同阶段不同目的的数据标签。临床标签和生物标志标签是两个例子。一般而言,临床标签更容易获得数量更多,因为它们在日常临床护理中定期收集,而生物标志标签需要专家分析和解释。在眼科领域,以前的工作表明临床价值显示与生物标志结构的相关性,这些结构在光学精确度成像(OCT)扫描中表现出来。我们利用临床和生物标志数据之间的关系来提高生物标志分类的性能。通过利用更多的临床数据作为没有生物标志标签的数据的假标签来获取这些标签,以便选择正面和负面的例子来培训有对比性损失的骨干网络。在眼科领域,骨干网络学习一个与临床数据分布相匹配的代表性空间,这种结构在光学与摄影学(OCT)扫描(OCT)扫描(OCT)扫描(OCI)扫描(OCI)中显示。随后,我们利用这些关键检测(O)网络中经过精细化的自我标记(O)的自我标记方法,这是我们所显示的生物标记的网络的自我标记方法,在生物标记(O)中显示的自我损失指标中显示的缩的比。