Modern machine learning increasingly supports paradigms that are multi-institutional (using data from multiple institutions during training) or cross-institutional (using models from multiple institutions for inference), but the empirical effects of these paradigms are not well understood. This study investigates cross-institutional learning via an empirical case study in higher education. We propose a framework and metrics for assessing the utility and fairness of student dropout prediction models that are transferred across institutions. We examine the feasibility of cross-institutional transfer under real-world data- and model-sharing constraints, quantifying model biases for intersectional student identities, characterizing potential disparate impact due to these biases, and investigating the impact of various cross-institutional ensembling approaches on fairness and overall model performance. We perform this analysis on data representing over 200,000 enrolled students annually from four universities without sharing training data between institutions. We find that a simple zero-shot cross-institutional transfer procedure can achieve similar performance to locally-trained models for all institutions in our study, without sacrificing model fairness. We also find that stacked ensembling provides no additional benefits to overall performance or fairness compared to either a local model or the zero-shot transfer procedure we tested. We find no evidence of a fairness-accuracy tradeoff across dozens of models and transfer schemes evaluated. Our auditing procedure also highlights the importance of intersectional fairness analysis, revealing performance disparities at the intersection of sensitive identity groups that are concealed under one-dimensional analysis.
翻译:现代机器学习越来越支持跨机构的范例(在训练期间使用多个机构的数据)或跨机构的范例(在推断时使用多个机构的模型),但这些范例的实证影响尚不清楚。本研究通过高等教育的实证案例研究探讨了跨机构学习。我们提出了一个框架和评估指标,用于评估在机构间转移的学生退学预测模型的效用和公平性。我们研究了在实际的数据和模型共享约束下进行跨机构转移的可行性,并量化了交叉敏感身份标识下模型偏差的影响,表征了由于这些偏差而可能产生的不平等影响,并研究了各种跨机构组合方法对公平性和整体模型性能的影响。我们在代表每年有超过200,000名注册学生的四个大学的数据上进行了这项分析,而这四个机构之间没有共享训练数据。我们发现,一个简单的零射跨机构转移程序可以在我们的研究中为所有机构实现类似于本地训练模型的性能,而不会牺牲模型的公平性。我们还发现,与本地模型或我们测试的零射转移程序相比,堆叠组合方法没有提供任何有关整体性能或公平性的额外收益。我们在数十个评估过的模型和转移方案中没有发现公平-准确度权衡的证据。我们的审计程序还强调了交叉公平性分析的重要性,揭示了在一维分析下隐藏的敏感身份群体交叉地位的性能差异。