Nonparametric and machine learning methods are flexible methods for obtaining accurate predictions. Nowadays, data sets with a large number of predictors and complex structures are fairly common. In the presence of item nonresponse, nonparametric and machine learning procedures may thus provide a useful alternative to traditional imputation procedures for deriving a set of imputed values. In this paper, we conduct an extensive empirical investigation that compares a number of imputation procedures in terms of bias and efficiency in a wide variety of settings, including high-dimensional data sets. The results suggest that a number of machine learning procedures perform very well in terms of bias and efficiency.
翻译:非参数和机器学习方法是获得准确预测的灵活方法。如今,拥有大量预测器和复杂结构的数据集相当普遍,因此,在出现不答复项目的情况下,非参数和机器学习程序可能为得出一套估算值的传统估算程序提供有用的替代方法。在本文件中,我们进行了广泛的实证调查,从包括高维数据集在内的各种环境中的偏差和效率的角度比较了一些估算程序。结果显示,一些机器学习程序在偏差和效率方面表现良好。