In this paper, we set up the theoretical foundations for a high-dimensional functional factor model approach in the analysis of large cross-sections (panels) of functional time series (FTS). We first establish a representation result stating that, under mild assumptions on the covariance operator of the cross-section, we can represent each FTS as the sum of a common component driven by scalar factors loaded via functional loadings, and a mildly cross-correlated idiosyncratic component. Our model and theory are developed in a general Hilbert space setting that allows for mixed panels of functional and scalar time series. We then turn to the identification of the number of factors, and the estimation of the factors, their loadings, and the common components. We provide a family of information criteria for identifying the number of factors, and prove their consistency. We provide average error bounds for the estimators of the factors, loadings, and common component; our results encompass the scalar case, for which they reproduce and extend, under weaker conditions, well-established similar results. Under slightly stronger assumptions, we also provide uniform bounds for the estimators of factors, loadings, and common component, thus extending existing scalar results. Our consistency results in the asymptotic regime where the number $N$ of series and the number $T$ of time observations diverge thus extend to the functional context the "blessing of dimensionality" that explains the success of factor models in the analysis of high-dimensional (scalar) time series. We provide numerical illustrations that corroborate the convergence rates predicted by the theory, and provide finer understanding of the interplay between $N$ and $T$ for estimation purposes. We conclude with an application to forecasting mortality curves, where we demonstrate that our approach outperforms existing methods.
翻译:在本文中,我们在分析功能时间序列(FTS)的大型剖面(剖面)时,为高层次功能要素模型方法建立了理论基础。我们首先在分析功能时间序列(FTS)的大型截面(剖面)时,首先设定了一个表达结果,指出在对截面的共变量操作者略微假设下,我们可以将每个FTS作为由通过功能负荷加载的卡路里因素驱动的共同组成部分的总和,以及一个轻微的跨度相关特质组成部分。我们的模型和理论是在一个总的Hilbert空间环境中开发的,它允许功能和卡路时间序列的混合面板。然后我们转而来解释各种因素的数量,以及各种因素、其负荷和共同组成部分的估算。我们提供了一系列信息标准,用以确定各种因素的数量,并证明它们的一致性。我们为各种因素、负荷和共同组成部分的估算者提供了平均误差界限;我们的结果包括了卡路面数据案例,在较弱的条件下,它们复制和扩展了各种方法。在稍为较弱的假设下,我们还提供了对数值的精确的数值序列观测结果进行了解释,从而得出了数值序列结果,从而将数值序列结果的缩缩缩缩的数值结果。