选择巴耶斯等级的半参数功能系数模型 (Semiparametric Functional Factor Models with Bayesian Rank Selection)

Functional data are frequently accompanied by parametric templates that describe the typical shapes of the functions. Although the templates incorporate critical domain knowledge, parametric functional data models can incur significant bias, which undermines the usefulness and interpretability of these models. To correct for model misspecification, we augment the parametric templates with an infinite-dimensional nonparametric functional basis. Crucially, the nonparametric factors are regularized with an ordered spike-and-slab prior, which implicitly provides rank selection and satisfies several appealing theoretical properties. This prior is accompanied by a parameter-expansion scheme customized to boost MCMC efficiency, and is broadly applicable for Bayesian factor models. The nonparametric basis functions are learned from the data, yet constrained to be orthogonal to the parametric template in order to preserve distinctness between the parametric and nonparametric terms. The versatility of the proposed approach is illustrated through applications to synthetic data, human motor control data, and dynamic yield curve data. Relative to parametric alternatives, the proposed semiparametric functional factor model eliminates bias, reduces excessive posterior and predictive uncertainty, and provides reliable inference on the effective number of nonparametric terms--all with minimal additional computational costs.

翻译：功能数据往往配有描述功能的典型形状的参数模板。虽然模板包含关键领域知识,但参数功能数据模型可能产生重大偏差,损害这些模型的有用性和可解释性。为了纠正模型的偏差,我们用无限的、非参数功能基础来补充参数模板。关键的是,非参数因素与事先定序的钉钉钉和滑块相正规化,这间接地提供了等级选择,并满足了若干有吸引力的理论属性。此前,还伴有一种为提升MCMCMC效率而定制的参数扩展功能模型,并广泛适用于Bayesian系数模型。从数据中学习了非参数基础功能,但受限制以垂直的方式与参数模板相适应,以保持参数参数和非参数的区别性。拟议方法的多变性通过合成数据应用、人类运动控制数据和动态收益曲线数据加以说明。相对于参数替代方法,拟议的半参数功能模型可以消除偏差,减少后方和预测不确定性,并广泛适用于Bayesia模型。非参数的可靠性估算成本为最低。