在一个连接单位的世界中序列因果推论 (Sequential causal inference in a single world of connected units)

We consider adaptive designs for a trial involving N individuals that we follow along T time steps. We allow for the variables of one individual to depend on its past and on the past of other individuals. Our goal is to learn a mean outcome, averaged across the N individuals, that we would observe, if we started from some given initial state, and we carried out a given sequence of counterfactual interventions for $\tau$ time steps. We show how to identify a statistical parameter that equals this mean counterfactual outcome, and how to perform inference for this parameter, while adaptively learning an oracle design defined as a parameter of the true data generating distribution. Oracle designs of interest include the design that maximizes the efficiency for a statistical parameter of interest, or designs that mix the optimal treatment rule with a certain exploration distribution. We also show how to design adaptive stopping rules for sequential hypothesis testing. This setting presents unique technical challenges. Unlike in usual statistical settings where the data consists of several independent observations, here, due to network and temporal dependence, the data reduces to one single observation with dependent components. In particular, this precludes the use of sample splitting techniques. We therefore had to develop a new equicontinuity result and guarantees for estimators fitted on dependent data. We were motivated to work on this problem by the following two questions. (1) In the context of a sequential adaptive trial with K treatment arms, how to design a procedure to identify in as few rounds as possible the treatment arm with best final outcome? (2) In the context of sequential randomized disease testing at the scale of a city, how to estimate and infer the value of an optimal testing and isolation strategy?

翻译：我们考虑对N个人进行试验的适应性设计,我们沿T时间步骤跟踪。我们允许一个人的变数取决于其过去和其他个人的过去。我们的目标是学习一个平均在N个人中平均的平均值结果,如果我们从某个给定的初始状态开始,我们就会观察,如果我们从某个给定的初始状态开始,我们执行了一个反事实干预的一定顺序,以美元计时间步骤。我们展示了如何确定一个统计参数,这个参数等于反事实结果,以及如何对这一参数进行推断,同时适应性地学习一个骨骼设计,将其定义为真正数据生成分布的参数。我们的目标在于了解一个平均结果,即尽可能提高利益统计参数的效率,或将最佳治疗规则与某种勘探分布结合起来。我们还展示了如何设计适应性规则,以按顺序进行测测测,这带来了独特的技术挑战。在通常的统计环境中,数据由若干独立的观察组成,由于网络和时间依赖性,数据被降为单一的观察,特别是,这排除了在精确处理的尺度上使用一个参数的参数的参数。奥氏度评估,从而无法在设计一个精度上使用一个精度分比技术。我们测的顺序测试,因此,我们不得不将一个测试了一个测试了这个结果。我们用一个测试到一个有动力的顺序的顺序的顺序测测测测测测测测测测测测测。我们用了一种新的结果。我们测测测测测测测测算。