A new algorithm is proposed for clustering longitudinal curves. The mean curves and the principal component functions are modeled using B-spline. The clusters of the mean curves are identified through a concave pairwise fusion method. The EM algorithm and the alternating direction method of multiplier algorithm are combined to estimate the group structure, mean functions and the principal components simultaneously. The proposed method also allows to incorporate the prior neighborhood information to have more meaningful groups by adding pairwise weights in the pairwise penalties. In the simulation study, the performance of the proposed method is compared to two existing clustering methods in terms of the accuracy for estimating the number of subgroups and mean functions. The results suggest that ignoring covariance structure will have a great effect on the performance of estimating the number of groups and estimating accuracy. The effect of including pairwise weights is also explored in a spatial lattice setting to take consideration of the spatial information. The results show that incorporating spatial weights will improve the performance. An example is used to illustrate the algorithm.
翻译:为长纵向曲线群集提议了新的算法。 平均曲线和主要组成部分功能采用B- spline 模型化。 平均曲线的组群通过对对齐组合法确定。 EM算法和乘数算法的交替方向法结合在一起, 以同时估计组群结构、 平均函数和主要组成部分。 拟议的方法还允许将以前的邻里信息纳入更有意义的组群, 在对对齐罚款中增加双对比重。 在模拟研究中, 将拟议方法的性能比作两种现有的组群方法, 以估计分组和中值函数数目的准确性为准。 结果表明, 忽略共变结构将对估计组群数和估计精度的性能产生很大影响。 将配对比权重包括在内的效果也在空间阵列环境中进行探讨, 以考虑到空间信息。 结果表明, 将空间权重纳入将改善性能。 举例来说明算法。