In this work, a novel elastic time distance for sparse multivariate functional data is proposed. Subsequently, a robust distance-based two-layer partition clustering is introduced. With the proposed distance, our approach not only can detect correct clusters for sparse multivariate functional data under outlier settings but also can detect those outliers that do not belong to any clusters. The classical distance-based clustering methods such as density-based spatial clustering of applications with noise (DBSCAN), agglomerative hierarchical clustering, and K-medoids are extended to the sparse multivariate functional case based on our proposed distance. Numerical experiments on the simulated data highlight that the performance of the proposed algorithm is superior to the performances of the existing model-based and extended distance-based methods. Using Northwest Pacific cyclone track data as an example, we demonstrate the effectiveness of the proposed approach. The code is available online for readers to apply our clustering method and replicate our analyses.
翻译:在这项工作中,提出了稀有多变量功能数据的新颖弹性时间距离。 随后,引入了强固的基于远程的双层分割组合组合。在提议的距离下,我们的方法不仅能够探测出外部环境中稀有多变量功能数据的正确组群,而且能够探测出不属于任何组群的离子体。传统的基于远程的群集方法,如以密度为基础的以空间为基础对有噪音的应用(DBSCAN)、聚合性等级集群和K-米多型生物群落进行基于我们提议的距离的稀薄多变量功能案例。模拟数据的量化实验突出表明,拟议算法的性能优于现有基于模型的扩展远程方法的性能。以西北太平洋气旋轨道数据为例,我们展示了拟议方法的有效性。该代码可供读者在线应用我们的组合法并复制我们的分析。