衡量和公平 (Measurement and Fairness)

We propose measurement modeling from the quantitative social sciences as a framework for understanding fairness in computational systems. Computational systems often involve unobservable theoretical constructs, such as socioeconomic status, teacher effectiveness, and risk of recidivism. Such constructs cannot be measured directly and must instead be inferred from measurements of observable properties (and other unobservable theoretical constructs) thought to be related to them -- i.e., operationalized via a measurement model. This process, which necessarily involves making assumptions, introduces the potential for mismatches between the theoretical understanding of the construct purported to be measured and its operationalization. We argue that many of the harms discussed in the literature on fairness in computational systems are direct results of such mismatches. We show how some of these harms could have been anticipated and, in some cases, mitigated if viewed through the lens of measurement modeling. To do this, we contribute fairness-oriented conceptualizations of construct reliability and construct validity that unite traditions from political science, education, and psychology and provide a set of tools for making explicit and testing assumptions about constructs and their operationalizations. We then turn to fairness itself, an essentially contested construct that has different theoretical understandings in different contexts. We argue that this contestedness underlies recent debates about fairness definitions: although these debates appear to be about different operationalizations, they are, in fact, debates about different theoretical understandings of fairness. We show how measurement modeling can provide a framework for getting to the core of these debates.

翻译：我们建议从社会科学数量上进行衡量,作为理解计算系统公平性的框架。计算系统通常涉及不易观察的理论结构,如社会经济地位、教师有效性和累犯风险。这种结构不能直接衡量,而必须从衡量认为与其相关的可观测属性(和其他不可观察的理论结构)的测量中推断出来,即通过衡量模型加以实施。这一过程必然涉及假设,在理论上理解所谓衡量的结构及其操作性之间有可能出现脱节。我们认为,计算系统公平性文献中讨论的许多损害都是这种不匹配的直接结果。我们表明,如何预见到这些损害,在某些情况下,如果从衡量模型的角度来看,这些损害是可以减轻的。为了做到这一点,我们促进以公平为导向的构筑可靠性概念,构建与政治科学、教育模式和心理学传统相结合的正确性,并提供一套工具,用于明确和测试关于构建及其操作性的假设。我们接着转向公平性,我们转向了这种不平等性文献本身。我们展示了一种基本上有争议的理论性辩论,但这种理论性辩论是不同的背景。