Artificial intelligence (AI) models are increasingly used in the medical domain. However, as medical data is highly sensitive, special precautions to ensure the protection of said data are required. The gold standard for privacy preservation is the introduction of differential privacy (DP) to model training. However, prior work has shown that DP has negative implications on model accuracy and fairness. Therefore, the purpose of this study is to demonstrate that the privacy-preserving training of AI models for chest radiograph diagnosis is possible with high accuracy and fairness compared to non-private training. N=193,311 high quality clinical chest radiographs were retrospectively collected and manually labeled by experienced radiologists, who assigned one or more of the following diagnoses: cardiomegaly, congestion, pleural effusion, pneumonic infiltration and atelectasis, to each side (where applicable). The non-private AI models were compared with privacy-preserving (DP) models with respect to privacy-utility trade-offs (measured as area under the receiver-operator-characteristic curve (AUROC)), and privacy-fairness trade-offs (measured as Pearson-R or Statistical Parity Difference). The non-private AI model achieved an average AUROC score of 0.90 over all labels, whereas the DP AI model with a privacy budget of epsilon=7.89 resulted in an AUROC of 0.87, i.e., a mere 2.6% performance decrease compared to non-private training. The privacy-preserving training of diagnostic AI models can achieve high performance with a small penalty on model accuracy and does not amplify discrimination against age, sex or co-morbidity. We thus encourage practitioners to integrate state-of-the-art privacy-preserving techniques into medical AI model development.
翻译:人工智能(AI)模型越来越多地用于医疗领域。然而,由于医疗数据高度敏感,需要特别谨慎,以确保保护上述数据。隐私保护金标准是引入差异隐私(DP)模型培训。然而,先前的工作表明,人工智能模型对模型准确性和公平性有负面影响。因此,本研究的目的是表明,对人工智能模型进行隐私保护培训,与非私人培训相比,可以具有较高的准确性和公平性。 N=193,311 高质量的临床胸腔射电图由有经验的放射学家追溯收集并手工标注,他们分配了一个或多个以下诊断:心血管、拥堵、胸腔腐蚀、充气渗透和露天建筑。非私人人工智能模型与隐私模型(DP)模型比较,与隐私模型交换。根据接受者-观察者-国家特征曲线(AUROC)下的区域计量,以及隐私-公平性交易(Oral-al-al-al-al-al-al-al-Iral-deal-deal-deal-deal-deal-deal-Ial-Ial-Ial-Ial-deal-deal-Ial-Ial-deal-deal-deal-Ial-Ial-Ial-deal-deal-deal-deal ex a dislation ex a dal deal deal deal deal dal ex ex a dal ex ex ex ex ex ex exal ex ex exxxxxxxxxxxxxxxxxxxxxxx制制所有性能性能性能性能性能性能性能制,比,比,将所有性能,将所有Ial-deal-Ialalal-I-I)所有I-I-I-I-I-I-I-I-I-I-I-I-dealalalalalal-I-I-Ialalalalalalalalalalalalalalalalal-I-dealalalal exalal exal exalalalalalalal-I-dealalalalalalalalalalalalal性能制制制制制制制制制制制制制制