Curve matching is a prediction technique that relies on predictive mean matching, which matches donors that are most similar to a target based on the predictive distance. Even though this approach leads to high prediction accuracy, the predictive distance may make matches look unconvincing, as the profiles of the matched donors can substantially differ from the profile of the target. To counterbalance this, similarity between the curves of the donors and the target can be taken into account by combining the predictive distance with the Mahalanobis distance into a `blended distance' measure. The properties of this measure are evaluated in two simulation studies. Simulation study I evaluates the performance of the blended distance under different data-generating conditions. The results show that blending towards the Mahalanobis distance leads to worse performance in terms of bias, coverage, and predictive power. Simulation study II evaluates the blended metric in a setting where a single value is imputed. The results show that a property of blending is the bias-variance trade off. Giving more weight to the Mahalanobis distance leads to less variance in the imputations, but less accuracy as well. The main conclusion is that the high prediction accuracy achieved with the predictive distance necessitates the variability in the profiles of donors.
翻译:曲线匹配是一种预测技术,它依赖于预测平均比对,与预测距离目标最相似的捐赠者匹配。虽然这种方法导致预测准确性很高,但预测距离可能使匹配看起来不令人信服,因为匹配捐赠者的情况可能与目标的轮廓大不相同。为了抵消这一影响,可以通过将预测距离与马哈拉诺比斯距离的曲线和目标之间的相似性结合到“混合距离”的测量中来加以考虑。在两个模拟研究中评估了这一措施的特性。模拟研究我评估了在不同数据生成条件下混合距离的性能。结果显示,与马哈拉诺比斯距离的混合在偏差、覆盖面和预测能力方面的性能可能大不相同。模拟研究二评估了单一价值的组合基准。结果显示,混合距离的特性是偏差交易。对马哈拉诺比斯距离的特性进行了两次模拟研究,使马哈拉诺比斯距离的重量增加,导致干扰减少,但在不同数据生成条件下混合距离的性能测得更低。主要结论是预测的准确性。主要结论是,与预测的准确性以及预测的准确性。