Random sampling of graph partitions under constraints has become a popular tool for evaluating legislative redistricting plans. Analysts detect partisan gerrymandering by comparing a proposed redistricting plan with an ensemble of sampled alternative plans. For successful application, sampling methods must scale to large maps with many districts, incorporate realistic legal constraints, and accurately and efficiently sample from a selected target distribution. Unfortunately, most existing methods struggle in at least one of these three areas. We present a new Sequential Monte Carlo (SMC) algorithm that draws representative redistricting plans from a realistic target distribution of choice. Because it samples directly, the SMC algorithm can efficiently explore the relevant space of redistricting plans better than the existing Markov chain Monte Carlo algorithms that yield dependent samples. Our algorithm can simultaneously incorporate several constraints commonly imposed in real-world redistricting problems, including equal population, compactness, and preservation of administrative boundaries. We validate the accuracy of the proposed algorithm by using a small map where all redistricting plans can be enumerated. We then apply the SMC algorithm to evaluate the partisan implications of several maps submitted by relevant parties in a recent high-profile redistricting case in the state of Pennsylvania. We find that the proposed algorithm is roughly 40 times more efficient in sampling from the target distribution than a state-of-the-art MCMC algorithm. Open-source software is available for implementing the proposed methodology.
翻译:分析员通过将拟议的重新划分计划与一系列抽样替代计划进行比较,发现党派偏差。为了成功应用,抽样方法必须扩大到许多地区的大地图,纳入现实的法律限制,并准确和高效地从选定的目标分布中抽取抽样。不幸的是,大多数现有方法至少在这三个地区中的一个地区挣扎。我们提出了一个新的序列式蒙特卡洛(SMC)算法,从现实的目标分布中提取代表重新划分计划。由于SMC算法直接进行抽样,因此,SMC算法可以有效地探索重新划分计划的相关空间,而比现有的马可夫链蒙特卡洛算法更能产生依赖的样本。我们的算法可以同时纳入现实世界重新划分问题中常见的一些限制,包括人口平等、紧凑性以及保护行政边界。我们用一个小的地图来验证拟议的算法的准确性,所有重新划分计划都可以在其中进行分类。我们随后运用SMC算法来评估有关各方提交的若干地图的贴切性影响,因为SMC直接进行抽样,因此可以比现有的马可依赖的蒙特卡洛运算法更好地探讨。我们从一个高比例分析系统中找到了一个在40州里,我们从一个拟议的标准中找到了一个可选择的公开分析方法。