Multivariate time-dependent data, where multiple features are observed over time for a set of individuals, are increasingly widespread in many application domains. To model these data we need to account for relations among both time instants and variables and, at the same time, for subjects heterogeneity. We propose a new co-clustering methodology for clustering individuals and variables simultaneously that is designed to handle both functional and longitudinal data. Our approach borrows some concepts from the curve registration framework by embedding the Shape Invariant Model in the Latent Block Model, estimated via a suitable modification of the SEM-Gibbs algorithm. The resulting procedure allows for several user-defined specifications of the notion of cluster that could be chosen on substantive grounds and provides parsimonious summaries of complex longitudinal or functional data by partitioning data matrices into homogeneous blocks.
翻译:多年来对一组个人观察到多种特征的多变时间依赖数据在许多应用领域日益普及。为了模拟这些数据,我们需要对时间瞬和变数之间的关系以及同时对主题的不同性进行核算。我们提出了一个新的同时对个人和变量进行分组的共同集群方法,该方法旨在处理功能性和纵向数据。我们的方法从曲线登记框架中借用了一些概念,将形状变异模型嵌入“低端区块模型”中,通过适当修改SEM-Gibbs算法加以估计。由此形成的程序允许对可按实质性理由选择的组群概念进行若干用户定义的规格,并通过将数据矩阵分割成同质区块来提供复杂纵向或功能数据的模糊摘要。