Creating large annotated datasets represents a major bottleneck for the development of deep learning models in radiology. To overcome this, we propose a combined use of weak labels (imprecise, but fast-to-create annotations) and Transfer Learning (TL). Specifically, we explore inductive TL, where source and target domains are identical, but tasks are different due to a label shift: our target labels are created manually by three radiologists, whereas the source weak labels are generated automatically from textual radiology reports. We frame knowledge transfer as hyperparameter optimization, thus avoiding heuristic choices that are frequent in related works. We investigate the relationship between model size and TL, comparing a low-capacity VGG with a higher-capacity SEResNeXt. The task that we address is change detection in follow-up glioma imaging: we extracted 1693 T2-weighted magnetic resonance imaging difference maps from 183 patients, and classified them into stable or unstable according to tumor evolution. Weak labeling allowed us to increase dataset size more than 3-fold, and improve VGG classification results from 75% to 82% Area Under the ROC Curve (AUC) (p=0.04). Mixed training from scratch led to higher performance than fine-tuning or feature extraction. To assess generalizability, we also ran inference on an open dataset (BraTS-2015: 15 patients, 51 difference maps), reaching up to 76% AUC. Overall, results suggest that medical imaging problems may benefit from smaller models and different TL strategies with respect to computer vision problems, and that report-generated weak labels are effective in improving model performances. Code, in-house dataset and BraTS labels are released.
翻译:创建大型附加说明的数据集是开发放射学深层学习模型的一大瓶颈。 为了克服这一障碍, 我们提议合并使用薄弱标签( 简化, 但快速到创建注释) 和传输学习( TL ) 。 具体地说, 我们探索导引 TL, 其源和目标域相同, 但任务因标签变化而不同 : 我们的目标标签是由三名放射学家手工创建的, 而来源薄弱标签则自动从文本上的放射系统报告中生成。 我们将知识转移作为超参数优化, 从而避免相关工作中经常出现的超光度选择 。 我们研究模型大小和 TL 之间的关系, 比较低容量 VGGH 和更高容量 SEResNeXt 。 我们处理的任务就是在后续性格成像中进行改变检测 : 我们从183 个病人中提取了1693 T2- 加权磁共振感应成成像图, 根据肿瘤的变异变, 将它们分类为稳定或不稳定 。 微量标签使我们得以将数据设置的大小超过3倍, 并改进VGS liglial lial dal dal dalation 。 (我们从75- dal dal dal deal deal deal deal disal deal deal deal deal deal disal disal disald) laction a der a der a der der) der deald to be be be a der deviolveald) der dealdations be der a der der der dealdatedated der device a der dece der device.