功能数据因数增量平滑模型 (Factor-augmented Smoothing Model for Functional Data)

We propose modeling raw functional data as a mixture of a smooth function and a highdimensional factor component. The conventional approach to retrieving the smooth function from the raw data is through various smoothing techniques. However, the smoothing model is not adequate to recover the smooth curve or capture the data variation in some situations. These include cases where there is a large amount of measurement error, the smoothing basis functions are incorrectly identified, or the step jumps in the functional mean levels are neglected. To address these challenges, a factor-augmented smoothing model is proposed, and an iterative numerical estimation approach is implemented in practice. Including the factor model component in the proposed method solves the aforementioned problems since a few common factors often drive the variation that cannot be captured by the smoothing model. Asymptotic theorems are also established to demonstrate the effects of including factor structures on the smoothing results. Specifically, we show that the smoothing coefficients projected on the complement space of the factor loading matrix is asymptotically normal. As a byproduct of independent interest, an estimator for the population covariance matrix of the raw data is presented based on the proposed model. Extensive simulation studies illustrate that these factor adjustments are essential in improving estimation accuracy and avoiding the curse of dimensionality. The superiority of our model is also shown in modeling Canadian weather data and Australian temperature data.

翻译：我们提议将原始功能数据建模,作为平稳功能和高维要素组成部分的混合体。从原始数据恢复平稳功能的传统方法是通过各种平滑技术。然而,平滑模型不足以恢复平稳曲线或在某些情况下捕捉数据差异。其中包括测量错误数量大、平滑基础功能被错误地确定,或功能平均水平的跨步跳动被忽略等情况。为了应对这些挑战,提议了一个要素增强平滑模型,并在实践中采用迭代数字估计方法。包括拟议方法中的要素模型组成部分解决了上述问题,因为有几个共同因素往往驱动着平滑模型无法捕捉的变异。还建立了“静态理论”以证明包含要素结构对平滑结果的影响。具体地说,我们表明,在要素装载矩阵补充空间上预测的平滑系数不那么正常。作为独立兴趣的副产品,一个人口变异性模型的数值估计方法解决了上述问题,因为拟议方法中的元素模型模型组成部分往往会促成一些共同因素导致平滑模型无法捕捉到的曲线曲线曲线曲线曲线曲线曲线。我们在模拟中展示了这些模型的精确性数据。在模拟中展示了我们所展示的精确性的数据的精确性。在模拟中展示了这些模型中的精确性。