We propose a framework for decision-making in the presence of strategic agents with panel data, a standard setting in econometrics and statistics where one gets noisy, repeated measurements of multiple units. We consider a setup where there is a pre-intervention period, when the principal observes the outcomes of each unit, after which the principal uses these observations to assign treatment to each unit. Our model can be thought of as a generalization of the synthetic controls and synthetic interventions frameworks, where units (or agents) may strategically manipulate pre-intervention outcomes to receive a more desirable intervention. We identify necessary and sufficient conditions under which a strategyproof mechanism that assigns interventions in the post-intervention period exists. Under a latent factor model assumption, we show that whenever a strategyproof mechanism exists, there is one with a simple closed form. In the setting where there is a single treatment and control (i.e., no other interventions), we establish that there is always a strategyproof mechanism, and provide an algorithm for learning such a mechanism. For the setting of multiple interventions, we provide an algorithm for learning a strategyproof mechanism, if there exists a sufficiently large gap in rewards between the different interventions. Along the way, we prove impossibility results for multi-class strategic classification, which may be of independent interest.
翻译:我们建议了一个在战略代理机构有小组数据的情况下进行决策的框架,一个计量经济和统计的标准设置,在这种情况下,人们会感到吵闹,反复测量多个单位。我们考虑一个设置,在干预前有一段时间,主要观察每个单位的结果,然后主要利用这些观察来分配每个单位的治疗。我们的模式可以被视为合成控制和合成干预框架的概括,单位(或代理机构)可以在战略上操纵干预前的结果,以获得更可取的干预。我们确定一个必要和充分的条件,在这种条件下,存在一个在干预后时期分配干预措施的防战略机制。在潜伏因素模型假设下,我们表明,只要战略防范机制存在,就有一个简单的封闭机制。在采用单一治疗和控制(即没有其他干预)的环境下,我们确定始终有一个防战略约束机制,为学习这种机制提供算法。对于制定多种干预,我们提供了一种学习战略防范机制的算法,如果在不同的干预中存在足够大的奖赏差距,那么在战略分类上,我们可能证明是不可能的。