In implementations of the functional data methods, the effect of the initial choice of an orthonormal basis has not gained much attention in the past. Typically, several standard bases such as Fourier, wavelets, splines, etc. are considered to transform observed functional data and a choice is made without any formal criteria indicating which of the bases is preferable for the initial transformation of the data into functions. In an attempt to address this issue, we propose a strictly data-driven method of orthogonal basis selection. The method uses recently introduced orthogonal spline bases called the splinets obtained by efficient orthogonalization of the B-splines. The algorithm learns from the data in the machine learning style to efficiently place knots. The optimality criterion is based on the average (per functional data point) mean square error and is utilized both in the learning algorithms and in comparison studies. The latter indicates efficiency that is particularly evident for the sparse functional data and to a lesser degree in analyses of responses to complex physical systems.
翻译:在执行功能数据方法时,最初选择正态基础的效果过去没有引起多少注意。通常,一些标准基础,如Fourier、波子、螺纹条等,被视为转换观察到的功能数据,作出选择时不采用任何正式标准,说明哪些基础更适合将数据初步转换为函数。为了解决这一问题,我们提议了严格由数据驱动的正方位选择方法。该方法最近采用的正方位样条基,称为通过高效或正方位化B-线获得的样板。算法从机器学习模式中的数据中学习,以便有效地结结结。最佳性标准以平均(每个功能数据点)为标准,在学习算法和比较研究中使用。后者指出效率,对于稀有功能数据来说尤为明显,在分析复杂的物理系统的反应方面,效率则较低。