Coronavirus (COVID-19) has shown an unprecedented global crisis by the detrimental effect on the global economy and health. The number of COVID-19 cases has been rapidly increasing, and there is no sign of stopping. It leads to a severe shortage of test kits and accurate detection models. A recent study demonstrated that the chest X-ray radiography outperformed laboratory testing in COVID-19 detection. Therefore, using chest X-ray radiography analysis can help to screen suspected COVID-19 cases at an early stage. Moreover, the patient data is sensitive, and it must be protected to avoid revealing through model updates and reconstruction from the malicious attacker. In this paper, we present a privacy-preserving Federated Learning system for COVID-19 detection based on chest X-ray images. First, a Federated Learning system is constructed from chest X-ray images. The main idea is to build a decentralized model across multiple hospitals without sharing data among hospitals. Second, we first show that the accuracy of Federated Learning for COVID-19 identification reduces significantly for Non-IID data. We then propose a strategy to improve model's accuracy on Non-IID COVID-19 data by increasing the total number of clients, parallelism (client fraction), and computation per client. Finally, we apply a Differential Privacy Stochastic Gradient Descent (DP-SGD) to enhance the preserving of patient data privacy for our Federated Learning model. A strategy is also proposed to keep the robustness of Federated Learning to ensure the security and accuracy of the model.
翻译:科罗纳病毒(科罗纳病毒(COVID-19)通过对全球经济和健康造成有害影响,显示了前所未有的全球危机。科罗纳病毒(COVID-19)的病例数量迅速增加,没有停止的迹象。它导致测试工具包和准确检测模型严重短缺。最近的一项研究表明,胸X射线射线X射线X光射线比COVID-19检测的实验室测试效果好。因此,利用胸X射线射线射线分析可以帮助在早期阶段筛查疑为COVID-19的病例。此外,病人数据是敏感的,必须保护其联邦数据,以避免通过恶意袭击者的模型更新和重建披露。在本文件中,我们介绍了一个基于胸X光图像检测COVID-19的保密联邦学习系统。首先,联邦学习系统是用胸X射线图像构建的实验室测试测试结果。主要想法是,在多个医院建立分散模式,而没有在医院之间共享数据。第二,我们首先表明,为CVID-19的定期身份识别数据模型的准确性会大大降低非IID的数据。我们然后提出一个战略来改进CRED的模型的准确性,用于不断更新的A-II-CISLILD的客户数据库。通过SAL-CM-CM-ROD数据,我们不断更新的客户的升级的升级数据。