Open Source Software (OSS) projects are typically the result of collective efforts performed by developers with different backgrounds. Although the quality of developers' contributions should be the only factor influencing the evaluation of the contributions to OSS projects, recent studies have shown that diversity issues are correlated with the acceptance or rejection of developers' contributions. This paper assists this emerging state-of-the-art body on diversity research with the first empirical study that analyzes how developers' perceptible race and ethnicity relates to the evaluation of the contributions in OSS. We performed a large-scale quantitative study of OSS projects in GitHub. We extracted the developers' perceptible race and ethnicity from their names in GitHub using the Name-Prism tool and applied regression modeling of contributions (i.e, pull requests) data from GHTorrent and GitHub. We observed that among the developers whose perceptible race and ethnicity was captured by the tool, only 16.56% were perceptible as Non-White developers; contributions from perceptible White developers have about 6-10% higher odds of being accepted when compared to contributions from perceptible Non-White developers; and submitters with perceptible non-white races and ethnicities are more likely to get their pull requests accepted when the integrator is estimated to be from their same race and ethnicity rather than when the integrator is estimated to be White. Our initial analysis shows a low number of Non-White developers participating in OSS. Furthermore, the results from our regression analysis lead us to believe that there may exist differences between the evaluation of the contributions from different perceptible races and ethnicities. Thus, our findings reinforce the need for further studies on racial and ethnic diversity in software engineering to foster healthier OSS communities.
翻译:开放源码软件(OSS)项目通常是具有不同背景的开发者集体努力的结果。虽然开发者贡献的质量应该是影响对开放源码软件项目贡献评价的唯一因素,但最近的研究显示,多样性问题与接受或拒绝开发者贡献相关。本文协助这个新兴的多样化研究最先进的机构,而第一次经验研究分析开发者明显种族和族裔如何与对开放源码软件贡献的评价相关。我们对GitHub的开放源码软件项目进行了大规模的定量研究。我们利用名称-底限工具从GitHub的开发者名称中提取了可见的种族和族裔贡献的质量。我们发现,在从GHOHorrent和GitHub的数据中采用了回归模型(即,拉动请求)。在首次实验研究中,从显见的种族和族裔差异中可以看出,只有16.56%的人可以作为非白人开发者;在GitHeriflical White开发者对Gitricks 做出的贡献中,比在GitHigh-10 % 之间,我们从可察觉的种族-10 %。当我们无法了解的种族多样性分析时,我们最初的种族和种族多样性分析显示的种族的种族多样性的种族差异的种族多样性需要时,而提交者则从我们更有可能从我们的种族和种族-推导的种族-推介为非种族和种族-种族-种族-种族多样性研究。在初步的种族多样性研究是非种族-种族-种族和推算。在初步的推算中,在初步的推算出其推算出其推算为非种族和推算为非种族和推算。