In a low-dimensional linear regression setup, considering linear transformations/combinations of predictors does not alter predictions. However, when the forecasting technology either uses shrinkage or is nonlinear, it does. This is precisely the fabric of the machine learning (ML) macroeconomic forecasting environment. Pre-processing of the data translates to an alteration of the regularization -- explicit or implicit -- embedded in ML algorithms. We review old transformations and propose new ones, then empirically evaluate their merits in a substantial pseudo-out-sample exercise. It is found that traditional factors should almost always be included as predictors and moving average rotations of the data can provide important gains for various forecasting targets. Also, we note that while predicting directly the average growth rate is equivalent to averaging separate horizon forecasts when using OLS-based techniques, the latter can substantially improve on the former when regularization and/or nonparametric nonlinearities are involved.
翻译:在低维线性回归装置中,考虑到预测器的线性变换/组合并不改变预测,然而,当预报技术使用缩缩或非线性时,它的确如此。这恰恰是机器学习(ML)宏观经济预测环境的结构。数据的预处理转化成对ML算法中嵌入的正规化 -- -- 明示或隐含 -- -- 的改变。我们审查旧变换和提出新的变换,然后在大规模假出样练习中从经验上评价其优点。发现传统因素几乎总是作为预测器和数据移动平均旋转来列入,可为各种预测目标带来重要收益。我们还注意到,虽然直接预测平均增长率相当于使用以OLS为基础的技术时平均地平线预报,但是当涉及正规化和/或非参数非线性非线性时,后者可以大大改进前者。