We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations. The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, federated model selection and federated model evaluation. To illustrate usage and assess FedScore's performance, we built a hypothetical global scoring system for mortality prediction within 30 days after a visit to an emergency department using 10 simulated sites divided from a tertiary hospital in Singapore. We employed a pre-existing score generator to construct 10 local scoring systems independently at each site and we also developed a scoring system using centralized data for comparison. We compared the acquired FedScore model's performance with that of other scoring models using the receiver operating characteristic (ROC) analysis. The FedScore model achieved an average area under the curve (AUC) value of 0.763 across all sites, with a standard deviation (SD) of 0.020. We also calculated the average AUC values and SDs for each local model, and the FedScore model showed promising accuracy and stability with a high average AUC value which was closest to the one of the pooled model and SD which was lower than that of most local models. This study demonstrates that FedScore is a privacy-preserving scoring system generator with potentially good generalizability.
翻译:我们提出FedScore, 是一个保护隐私的联合会式学习框架,用于在多个地点建立评分系统,以促进跨机构合作。FedScore框架包括五个模块:联合变量排名、联合变量变换、联合分算、联合得分衍生、联合模式选择和联合模型评价。为了说明使用情况并评估FedScore的业绩,我们在访问一个紧急部门后30天内,利用新加坡一家三级医院的10个模拟地点,为死亡率预测建立了一个假设的全球评分系统。我们还使用一个原有的评分生成器,在每一个地点独立建立10个地方评分系统,我们还利用集中数据进行了比较,开发了一个评分系统。我们用接收器操作特性(ROC)分析,将获得的FedScore模型与其他评分模型的绩效进行了比较。FedScreco模型在所有地点均达到0.763的曲线值下的平均区域,标准偏差为0.020。我们还计算了每个地方模型的平均AUC值和SDFDCS平均评分模型的准确性和稳定性最接近于AS的亚值。</s>