The conclusions of randomized controlled trials may be biased when the outcome of one unit depends on the treatment status of other units, a problem known as interference. In this work, we study interference in the setting of one-sided bipartite experiments in which the experimental units - where treatments are randomized and outcomes are measured - do not interact directly. Instead, their interactions are mediated through their connections to interference units on the other side of the graph. Examples of this type of interference are common in marketplaces and two-sided platforms. The cluster-randomized design is a popular method to mitigate interference when the graph is known, but it has not been well-studied in the one-sided bipartite experiment setting. In this work, we formalize a natural model for interference in one-sided bipartite experiments using the exposure mapping framework. We first exhibit settings under which existing cluster-randomized designs fail to properly mitigate interference under this model. We then show that minimizing the bias of the difference-in-means estimator under our model results in a balanced partitioning clustering objective with a natural interpretation. We further prove that our design is minimax optimal over the class of linear potential outcomes models with bounded interference. We conclude by providing theoretical and experimental evidence of the robustness of our design to a variety of interference graphs and potential outcomes models.
翻译:当一个单位的结果取决于其他单位的处理状况时,随机控制试验的结论可能会有偏差,因为一个单位的结果取决于其他单位的处理状况,这是一个被称为干扰的问题。在这项工作中,我们研究对单面双方试验的设置的干扰,试验单位-处理是随机的,结果是测量的-不直接相互作用。相反,它们的互动通过它们与图另一侧的干扰单位的联系进行调解。这种干扰的例子在市场和双面平台中是常见的。集束随机设计是在图表已知时减少干扰的流行方法,但在片面的双方试验设置中没有很好地加以研究。在这项工作中,我们利用暴露绘图框架正式确定了单面双方试验干预的自然模式。我们首先展示了现有集束随机设计无法适当减轻干扰的环境。我们然后表明,在我们模型下,将差异估计的偏差的偏差的偏差程度降到了一种平衡的分隔组合目标,但在片面的双面试验环境中,我们进一步证明我们的设计设计是一小的干涉的理论模型,我们以最优的模型的形式展示了我们设计中最优的线性模型。