Along with confounding, selection bias is one of the fundamental threats to the validity of epidemiologic research. Unlike confounding, it has yet to be given a standard definition in terms of potential outcomes. Traditionally, selection bias has been defined as a systematic difference in a measure of the exposure-disease association in the study population and the population eligible for inclusion. This definition depends on the parameterization of the association between exposure and disease. The structural approach to selection bias defines selection bias as a spurious exposure-disease association within the study population that occurs when selection is a collider or a descendant of a collider on a causal path from exposure to disease in the eligible population. This definition covers only selection bias that can occur under the null hypothesis. Here, we propose a definition of selection bias in terms of potential outcomes that identifies selection bias whenever disease risks and exposure prevalences are distorted by the selection of study participants, not just a given measure of association (as in the traditional approach) or all measures of association (as in the structural approach). This definition is nonparametric, so it can be analyzed using causal graphs both under and away from the null. It unifies the theoretical frameworks used to understand selection bias and confounding, explicitly links selection to the estimation of causal effects, distinguishes clearly between internal and external validity, and simplifies the analysis of complex study designs.
翻译:选择偏好是影响流行病学研究有效性的根本威胁之一。与混淆不同,它尚未在潜在结果方面给出一个标准定义。传统上,选择偏好被定义为研究人口和有资格被接纳的人口在衡量接触-疾病关联方面的系统性差异。这一定义取决于接触与疾病关联的参数化。选择偏向的结构性方法将选择偏向定义为研究人口中一种虚假的接触-疾病关联。当选择是合格人口中疾病接触因果路径的串联者或串联者后裔时,选择偏向则不同。这一定义仅涵盖在无效假设下可能出现的选择偏向。这里,我们提议在潜在结果方面确定选择偏向,在选择研究参与者扭曲疾病风险和接触流行率时,确定选择偏向,而不仅仅是特定关联度(传统方法)或所有关联度(结构方法),这一定义是非定量的,因此可以使用因果关系图进行对比分析,从而明确理解选择结果的内在和外部选择结果之间的因果关系。我们提出了选择的理论框架,并明确区分了外部选择结果的准确性。