We consider the problem of constructing matched groups such that the resulting groups are statistically similar with respect to their average values for multiple covariates. This group-matching problem arises in many cases, including quasi-experimental and observational studies in which subjects or items are sampled from pre-existing groups, scenarios in which traditional pair-matching approaches may be inappropriate. We consider the case in which one is provided with an existing sample and iteratively eliminates samples so that the groups "match" according to arbitrary statistically-defined criteria. This problem is NP-hard. However, using artificial and real-world data sets, we show that heuristics implemented by the ldamatch package produce high-quality matches.
翻译:我们认为建立匹配组的问题,因此,由此形成的组群在统计上与多个共变体的平均值相似,在许多情况下出现这种组群匹配问题,包括从原已存在的组群中抽取主题或物品的准实验和观察研究,在这种研究中,传统的配对方法可能不适当,我们认为,提供现有样本并迭接地消除样本,以便按照任意统计界定的标准“匹配”的组群。这是NP硬的问题。然而,我们利用人工和现实世界的数据集,表明由阿尔达马奇套件执行的超常方法会产生高质量的匹配。