Estimating the effects of continuous-valued interventions from observational data is a critically important task for climate science, healthcare, and economics. Recent work focuses on designing neural network architectures and regularization functions to allow for scalable estimation of average and individual-level dose-response curves from high-dimensional, large-sample data. Such methodologies assume ignorability (observation of all confounding variables) and positivity (observation of all treatment levels for every covariate value describing a set of units), assumptions problematic in the continuous treatment regime. Scalable sensitivity and uncertainty analyses to understand the ignorance induced in causal estimates when these assumptions are relaxed are less studied. Here, we develop a continuous treatment-effect marginal sensitivity model (CMSM) and derive bounds that agree with the observed data and a researcher-defined level of hidden confounding. We introduce a scalable algorithm and uncertainty-aware deep models to derive and estimate these bounds for high-dimensional, large-sample observational data. We work in concert with climate scientists interested in the climatological impacts of human emissions on cloud properties using satellite observations from the past 15 years. This problem is known to be complicated by many unobserved confounders.
翻译:最近的工作重点是设计神经网络架构和正规化功能,以便从高维、大抽样数据中对平均和个别剂量反应曲线进行可缩放的估计。这些方法假定可忽略(观察所有令人困惑的变量)和现实性(观察每个共变值的所有处理水平,描述一套单位),假设在连续治疗制度中存在问题。在这些假设放松时,为了解因果估计引起的无知而进行可缩放的敏感度和不确定性分析研究较少。在这里,我们开发了一个持续治疗效应边缘敏感度模型(CMSM),并获得与所观测的数据一致的界限,以及一个与研究人员确定的隐蔽共解程度一致的界限。我们采用了一种可缩放算法和有不确定性的深层模型,以得出和估计高维度、大型观测数据的界限。我们与对人类排放对云层特性的气候学影响有兴趣的气候科学家们一起工作,通过从过去15年的卫星观测了解到的复杂问题。