We consider the task of evaluating policies of algorithmic resource allocation through randomized controlled trials (RCTs). Such policies are tasked with optimizing the utilization of limited intervention resources, with the goal of maximizing the benefits derived. Evaluation of such allocation policies through RCTs proves difficult, notwithstanding the scale of the trial, because the individuals' outcomes are inextricably interlinked through resource constraints controlling the policy decisions. Our key contribution is to present a new estimator leveraging our proposed novel concept, that involves retrospective reshuffling of participants across experimental arms at the end of an RCT. We identify conditions under which such reassignments are permissible and can be leveraged to construct counterfactual trials, whose outcomes can be accurately ascertained, for free. We prove theoretically that such an estimator is more accurate than common estimators based on sample means -- we show that it returns an unbiased estimate and simultaneously reduces variance. We demonstrate the value of our approach through empirical experiments on synthetic, semi-synthetic as well as real case study data and show improved estimation accuracy across the board.
翻译:我们考虑通过随机控制试验来评估算法资源分配政策的任务。这种政策的任务是优化有限干预资源的利用,以尽量扩大所产生的效益。尽管试验的规模很大,但通过RCT来评估这种分配政策证明是困难的,因为通过控制政策决定的资源限制,个人的结果是密不可分的。我们的主要贡献是提出一个新的估计者,利用我们提议的新概念,包括在RCT结束时对试验武器参与者进行回溯性调整。我们确定允许和可以利用这种重新分配的条件,以便建立反事实试验,其结果可以准确确定,免费。我们从理论上证明,这种估计者比基于抽样手段的普通估计者更准确 -- -- 我们表明,它可以得出不偏颇的估计,同时减少差异。我们通过对合成、半合成和真实的案例研究数据进行实验实验,来证明我们的方法的价值,并显示全局的估算准确性得到改善。