Considering the context of functional data analysis, we developed and applied a new Bayesian approach via Gibbs sampler to select basis functions for a finite representation of functional data. The proposed methodology uses Bernoulli latent variables to assign zero to some of the basis function coefficients with a positive probability. This procedure allows for an adaptive basis selection since it can determine the number of bases and which should be selected to represent functional data. Moreover, the proposed procedure measures the uncertainty of the selection process and can be applied to multiple curves simultaneously. The methodology developed can deal with observed curves that may differ due to experimental error and random individual differences between subjects, which one can observe in a real dataset application involving daily numbers of COVID-19 cases in Brazil. Simulation studies show the main properties of the proposed method, such as its accuracy in estimating the coefficients and the strength of the procedure to find the true set of basis functions. Despite having been developed in the context of functional data analysis, we also compared the proposed model via simulation with the well-established LASSO and Bayesian LASSO, which are methods developed for non-functional data.
翻译:考虑到功能数据分析的背景,我们通过Gibbs采样器开发并应用了一种新的巴伊西亚方法,为功能数据的有限代表性选择基础功能。拟议方法使用Bernoulli 潜伏变量为某些基础函数系数分配零,概率为正数。这一程序允许采用适应性基础选择,因为它可以确定基数的数量,并且应当选择哪些基础来代表功能数据。此外,拟议程序测量了选择过程的不确定性,可以同时适用于多个曲线。所开发的方法可以处理观察到的曲线,这些曲线可能因实验错误和不同对象之间的随机个别差异而有所不同,这可以在涉及巴西COVID-19日数的真实数据集应用中观察到。模拟研究显示了拟议方法的主要特性,例如其在估计系数方面的准确性以及找到真正基础功能集的程序的力度。尽管是在功能数据分析的背景下开发的,但我们也通过模拟了拟议的模型,与为非功能数据开发的方法而建立的LASSO和Bayesian LASSO进行了比较。