Historically used in settings where the outcome is rare or data collection is expensive, outcome-dependent sampling is relevant to many modern settings where data is readily available for a biased sample of the target population, such as public administrative data. Under outcome-dependent sampling, common effect measures such as the average risk difference and the average risk ratio are not identified, but the conditional odds ratio is. Aggregation of the conditional odds ratio is challenging since summary measures are generally not identified. Furthermore, the marginal odds ratio can be larger (or smaller) than all conditional odds ratios. This so-called non-collapsibility of the odds ratio is avoidable if we use an alternative aggregation to the standard arithmetic mean. We provide a new definition of collapsibility that makes this choice of aggregation method explicit, and we demonstrate that the odds ratio is collapsible under geometric aggregation. We describe how to partially identify, estimate, and do inference on the geometric odds ratio under outcome-dependent sampling. Our proposed estimator is based on the efficient influence function and therefore has doubly robust-style properties.
翻译:历史上,在结果少见或数据收集费用昂贵的情况下,根据结果进行的抽样与许多现代环境有关,因为对目标人口有偏差的抽样可以随时获得数据,例如公共行政数据。在基于结果的抽样中,没有确定平均风险差异和平均风险比率等共同效果措施,但有条件的胜率比率是。由于一般没有确定简要措施,对有条件的胜率比率的汇总具有挑战性。此外,边际胜率比率可能大于(或小于)所有有条件的胜率比率。如果我们使用与标准算术平均值的替代组合,这种所谓的不易比率的不重叠性是可以避免的。我们提供了使这种组合方法选择明确化的共和性新定义,我们证明在几何组合下,概率比率是可相互重叠的。我们描述了在基于结果的抽样中如何部分确定、估计和判断几何差率比率。我们提议的估计依据的是高效的影响功能,因此具有双重稳健的特性。