We study causal inference under case-control and case-population sampling. For this purpose, we focus on the binary-outcome and binary-treatment case, where the parameters of interest are causal relative and attributable risk defined via the potential outcome framework. It is shown that strong ignorability is not always as powerful as it is under random sampling and that certain monotonicity assumptions yield comparable results in terms of sharp identified intervals. Specifically, the usual odds ratio is shown to be a sharp identified upper bound on causal relative risk under the monotone treatment response and monotone treatment selection assumptions. We then discuss averaging the conditional (log) odds ratio and propose an algorithm for semiparametrically efficient estimation when averaging is based only on the (conditional) distributions of the covariates that are identified in the data. We also offer algorithms for causal inference if the true population distribution of the covariates is desirable for aggregation. We show the usefulness of our approach by studying two empirical examples from social sciences: the benefit of attending private school for entering a prestigious university in Pakistan and the causal relationship between staying in school and getting involved with drug-trafficking gangs in Brazil.
翻译:我们根据个案控制和个案人口抽样研究因果推断。为此目的,我们侧重于二进制和二进制处理案例,其利益参数是因果相对和根据潜在结果框架界定的可归因风险。我们发现,强烈忽视并不总是像随机抽样那样强大,某些单一性假设在明确间隔方面得出可比较的结果。具体地说,通常的胜率被显示为在单体治疗反应和单体治疗选择假设下对因果相对风险的高度确定。我们然后讨论条件(log)概率比率的平均值,并提议一种半对称有效估算算法,而平均值仅以数据中查明的共变体(有条件)分布为基础。我们还提供了一些算法,用以推断共变体的真正人口分布是否适宜于汇总。我们通过研究社会科学的两个经验实例来表明我们的方法的效用:在巴基斯坦进入一所有声望的私立学校的好处,以及留在学校与参与巴西贩毒团伙之间的因果关系。