Under-representation of certain populations, based on gender, race/ethnicity, and age, in data collection for predictive modeling may yield less-accurate predictions for the under-represented groups. Recently, this issue of fairness in predictions has attracted significant attention, as data-driven models are increasingly utilized to perform crucial decision-making tasks. Methods to achieve fairness in the machine learning literature typically build a single prediction model subject to some fairness criteria in a manner that encourages fair prediction performances for all groups. These approaches have two major limitations: i) fairness is often achieved by compromising accuracy for some groups; ii) the underlying relationship between dependent and independent variables may not be the same across groups. We propose a Joint Fairness Model (JFM) approach for binary outcomes that estimates group-specific classifiers using a joint modeling objective function that incorporates fairness criteria for prediction. We introduce an Accelerated Smoothing Proximal Gradient Algorithm to solve the convex objective function, and demonstrate the properties of the proposed JFM estimates. Next, we presented the key asymptotic properties for the JFM parameter estimates. We examined the efficacy of the JFM approach in achieving prediction performances and parities, in comparison with the Single Fairness Model, group-separate model, and group-ignorant model through extensive simulations. Finally, we demonstrated the utility of the JFM method in the motivating example to obtain fair risk predictions for under-represented older patients diagnosed with coronavirus disease 2019 (COVID-19).
翻译:根据性别、种族/族裔和年龄,在为预测性模型收集数据时,某些人口代表人数不足,根据性别、种族/族裔和年龄,某些群体在性别、种族/族裔和年龄方面的代表性不足,可能会对代表性不足的群体作出不准确的预测。最近,预测中的公平问题引起人们的极大关注,因为数据驱动模型越来越多地被用于执行关键的决策任务。在机器学习文献中实现公平的方法通常会建立一个单一的预测模型,但需遵循某种公平标准,从而鼓励所有群体获得公平的预测业绩。这些方法有两个主要限制:(1) 公平常常通过损害某些群体的准确性来实现;(2) 依附性和独立变数之间的根本关系可能不同群体之间不同。我们提出了一种双轨结果的联合公平模型(JFM)方法,即利用包含公平性预测标准的联合模型,估算特定群体分类人员。我们采用了一种加速的平稳度分级模型,用以解决共性目标功能,并展示了拟议JFMFM估计数的特性。 我们介绍了JFM模型中的主要特征特征特性,在对JFMA-FM方法进行广泛的预测中,我们通过常规性模型对20级模型进行了广泛的预测。