We propose a framework for decision-making in the presence of strategic agents with panel data, a standard setting in econometrics and statistics where one gets noisy, repeated measurements of multiple units. We consider a setup where there is a pre-intervention period, when the principal observes the outcomes of each unit, after which the principal uses these observations to assign a treatment to each unit. Our model can be thought of as a generalization of the synthetic controls and synthetic interventions frameworks, where units (or agents) may strategically manipulate pre-intervention outcomes to receive a more desirable intervention. We identify necessary and sufficient conditions under which a strategyproof mechanism that assigns interventions in the post-intervention period exists. Under a latent factor model assumption, we show that whenever a strategyproof mechanism exists, there is one with a simple closed form. In the setting where there is a single treatment and control (i.e., no other interventions), we establish that there is always a strategyproof mechanism, and provide an algorithm for learning such a mechanism. For the setting of multiple interventions, we provide an algorithm for learning a strategyproof mechanism, if there exists a sufficiently large gap in rewards between the different interventions. Finally, we empirically evaluate our model using real-world panel data collected from product sales over 18 months. We find that our methods compare favorably to baselines which do not take strategic interactions into consideration -- even in the presence of model misspecification. Along the way, we prove impossibility results for multi-class strategic classification, which may be of independent interest.
翻译:我们提出一个框架,以便在战略代理机构具备小组数据的情况下进行决策;在计量经济学和统计的标准设置方面,人们会感到吵闹,反复测量多个单位;我们考虑在存在干预前阶段时,由主观察每个单位的结果,然后由主观察对每个单位进行处理;我们的模式可以被视为综合控制和综合干预框架的概括,单位(或代理机构)可在战略上操纵干预前结果,以获得更可取的干预;我们确定必要和充分的条件,在这种条件下,存在一个在干预后时期分配干预措施的防战略机制;在潜在因素模型假设下,我们表明,每当存在战略防范机制时,就有一个简单的封闭机制;在这样的情况下,主要观察意见用于对每个单位进行单一的处理和控制(即,没有其他干预),我们确定我们的模式总是有防战略约束性的机制,为学习这种机制提供算法。对于制定多种干预,我们提供了一种用于学习战略防范机制的算法,如果在不同的干预期间存在相当大的奖励差距,那么在不同的干预期间,我们就会存在一种战略保密机制;在潜在的要素模型存在时,我们就会表明,只要存在一种简单的封闭式机制;最后,我们用一个单一的模型来评估我们用18个模型来进行实验性的数据考虑。