A subset of machine learning research intersects with societal issues, including fairness, accountability and transparency, as well as the use of machine learning for social good. In this work, we analyze the scholars contributing to this research at the intersection of machine learning and society through the lens of the sociology of science. By analyzing the authorship of all machine learning papers posted to arXiv, we show that compared to researchers from overrepresented backgrounds (defined by gender and race/ethnicity), researchers from underrepresented backgrounds are more likely to conduct research at this intersection than other kinds of machine learning research. This state of affairs leads to contention between two perspectives on insiders and outsiders in the scientific enterprise: outsiders being those outside the group being studied, and outsiders being those who have not participated as researchers in an area historically. This contention manifests as an epistemic question on the validity of knowledge derived from lived experience in machine learning research, and predicts boundary work that we see in a real-world example.
翻译:一系列机器学习研究与社会问题交织在一起,包括公平、问责制和透明度,以及利用机器学习促进社会福利。在这项工作中,我们通过科学的社会学透镜分析在机器学习和社会交汇处为这项研究作出贡献的学者。通过分析向ArXiv张贴的所有机器学习论文的作者,我们表明,与来自代表比例过高背景(由性别和种族/族裔定义)的研究人员相比,代表性不足背景的研究人员比其他类型的机器学习研究更可能在这个交叉处进行研究。这种事态导致科学企业内部人和外部人两种观点之间的争论:外部人是被研究的群体以外的人,外部人是那些没有作为历史领域的研究人员参与的人。这一争论表明,从机器学习研究的活生生经验中获取的知识的有效性是一个教义问题,并预测我们在现实世界中看到的界限工作。