A new model-based procedure is developed for sparse clustering of functional data that aims to classify a sample of curves into homogeneous groups while jointly detecting the most informative portions of domain. The proposed method is referred to as sparse and smooth functional clustering (SaS-Funclust) and relies on a general functional Gaussian mixture model whose parameters are estimated by maximizing a log-likelihood function penalized with a functional adaptive pairwise penalty and a roughness penalty. The former allows identifying the noninformative portion of domain by shrinking the means of separated clusters to some common values, whereas the latter improves the interpretability by imposing some degree of smoothing to the estimated cluster means. The model is estimated via an expectation-conditional maximization algorithm paired with a cross-validation procedure. Through a Monte Carlo simulation study, the SaS-Funclust method is shown to outperform other methods already appeared in the literature, both in terms of clustering performance and interpretability. Finally, three real-data examples are presented to demonstrate the favourable performance of the proposed method. The SaS-Funclust method is implemented in the $\textsf{R}$ package $\textsf{sasfunclust}$, available online at https://github.com/unina-sfere/sasfunclust.
翻译:开发了新的基于模型的程序,将功能性数据分散地分组,目的是将曲线样本分类为单一组,同时共同发现最丰富的域块。建议的方法称为分散和平稳的功能组群(SaS-Funclust),并依赖于一般功能性高斯混合模型,该模型的参数通过最大限度地增加对功能性适应性惩罚和粗糙处罚的对数值值功能性功能性功能来估算。前者允许通过将分离组群的手段压缩为某些共同值来识别域的非信息性部分,而后者则通过对估计组群手段施加一定程度的平滑来改进可解释性。该模型通过期待性条件最大化算法和交叉校验程序来估算。通过蒙特卡洛模拟研究,SaS-Funclust方法在组合性表现和可解释性两方面都显示超过文献中已经出现的其他方法。最后,提供了三个真实数据实例,以显示拟议方法的有利性能。SAS-Funclustt 方法在$\ textrus@Rfsmexus/ $unflias/ estresmus@emflias@emexus@https@ romexus@mus@ sups/ subs@ subs@ems@emexlixxxxxx@