Deep learning models can enable accurate and efficient disease diagnosis, but have thus far been hampered by the data scarcity present in the medical world. Automated diagnosis studies have been constrained by underpowered single-center datasets, and although some results have shown promise, their generalizability to other institutions remains questionable as the data heterogeneity between institutions is not taken into account. By allowing models to be trained in a distributed manner that preserves patients' privacy, federated learning promises to alleviate these issues, by enabling diligent multi-center studies. We present the first federated learning study on the modality of cardiovascular magnetic resonance (CMR) and use four centers derived from subsets of the M\&M and ACDC datasets, focusing on the diagnosis of hypertrophic cardiomyopathy (HCM). We adapt a 3D-CNN network pretrained on action recognition and explore two different ways of incorporating shape prior information to the model, and four different data augmentation set-ups, systematically analyzing their impact on the different collaborative learning choices. We show that despite the small size of data (180 subjects derived from four centers), the privacy preserving federated learning achieves promising results that are competitive with traditional centralized learning. We further find that federatively trained models exhibit increased robustness and are more sensitive to domain shift effects.
翻译:深层次学习模式可以进行准确有效的疾病诊断,但迄今为止还受到医学界目前数据短缺的阻碍。自动化诊断研究受到动力不足的单一中心数据集的制约,尽管有些结果显示有希望,但是由于各机构之间数据差异性没有被考虑进去,这些模型对其他机构的普遍适用性仍然有疑问。我们通过允许以分散方式培训模型,保护病人隐私,联合学习承诺通过认真的多中心研究来缓解这些问题。我们介绍了关于心血管磁共振模式的第一次联合学习研究,并使用M ⁇ M和ACDC数据集的4个中心,重点是对超营养性心血管心血管心血管病病(HCM)的诊断。我们调整了3D-CNN网络,先于行动识别,并探索了将形状信息纳入模型的两种不同方法,以及四个不同的数据增强组合,系统分析其对不同协作学习选择的影响。我们发现,尽管数据规模小(180个),但使用M ⁇ M和ACDC数据集的4个中心数据集,侧重于诊断超营养性心血管疗法。我们经过培训的中央空间学习更富强力的模型取得了更有希望的成果。