Measurement bridges theory and empirics. Without measures that appropriately capture theoretical concepts, description will fail to represent reality and true causal inference will be impossible. Yet, the social sciences traffic in complex concepts and their measurement is difficult. Item Response Theory (IRT) models reduce variation in multiple variables to continuous variation along one or more latent dimensions intended to capture key theoretical concepts. Unfortunately, those latent dimensions have no intrinsic conceptual meaning. Partial solutions to that problem include limiting the number of dimensions to one or assigning meaning post-analysis, but either can lead to potential bias and a lack of reliability across data sources. We propose, detail, and validate a semi-supervised approach employing Bayesian Item Response Theory on multiple latent dimensions and binary data. Our approach, which we validate on simulated and real data, yields conceptually meaningful latent dimensions that are reliable across different data sources without additional exogenous assumptions.
翻译:测量架桥理论和空洞。没有适当捕捉理论概念的措施,描述将无法代表现实和真正的因果推断。然而,复杂概念的社会科学流量及其测量是困难的。项目反应理论模型将多种变量的差异降低到一个或一个以上旨在捕捉关键理论概念的潜在维度的连续变化。不幸的是,这些潜在维度没有内在的概念意义。这个问题的部分解决办法包括将维度限制为一个维度,或指定分析后的含义,但两者都可能导致潜在的偏差和数据来源之间缺乏可靠性。我们提议、详细和验证一种半监督的方法,在多个潜在维度和二元数据方面采用巴伊西亚项目反应理论。我们用来验证模拟和实际数据的方法产生了具有概念意义的潜在维度,在不附加外在假设的情况下,在不同数据源之间具有可靠性。