We put forward a simple new randomized missing data (RMD) approach to robust filtering of state-space models, motivated by the idea that the inclusion of only a small fraction of available highly precise measurements can still extract most of the attainable efficiency gains for filtering latent states, estimating model parameters, and producing out-of-sample forecasts. In our general RMD framework we develop two alternative implementations: endogenous (RMD-N) and exogenous (RMD-X) randomization of missing data. A degree of robustness to outliers and model misspecification is achieved by purposely randomizing over the utilized subset of data measurements in their original time series order, while treating the rest as if missing. The arising robustness-efficiency trade-off is controlled by varying the fraction of randomly utilized measurements. Our RMD framework thus relates to but is different from a wide range of machine learning methods trading off bias against variance. It also provides a time-series extension of bootstrap aggregation (bagging). As an empirical illustration, we show consistently attractive performance of RMD filtering and forecasting in popular state space models for extracting inflation trends known to be hindered by measurement outliers.
翻译:我们提出了一种简单、新的随机缺失数据(RMD)方法,以稳健地筛选国家空间模型,其动机是,仅仅纳入一小部分现有高度精确的测量数据,仍然能够提取大部分可实现的效率收益,用于过滤潜伏状态,估计模型参数,并作出抽样预测。在我们的总体RMD框架中,我们开发了两种不同的执行方法:本地(RMD-N)和外源(RMD-X)数据随机化。对外源和模型定型的强度,通过在原始时间序列顺序中对已使用的数据计量子集进行故意随机随机抽查,而将其余的则视其为缺失,从而达到一定程度的强度和模型定型误差,同时将其余作为缺失处理。正在形成的稳健性效率交易是通过随机利用测量的分数加以控制的。因此,我们的RMD框架与广泛的机器学习方法有区别,而不同的是用偏差来交换偏差。它也提供了靴带汇总的时间序列扩展(缩缩)。作为经验性说明,我们展示了RMD过滤和预测流行空间模型的一贯有吸引力的性表现,以得出已知因测量而受到阻碍的通货膨胀趋势。