Modern data science applications often involve complex relational data with dynamic structures. An abrupt change in such dynamic relational data is typically observed in systems that undergo regime changes due to interventions. In such a case, we consider a factorized fusion shrinkage model in which all decomposed factors are dynamically shrunk towards group-wise fusion structures, where the shrinkage is obtained by applying global-local shrinkage priors to the successive differences of the row vectors of the factorized matrices. The proposed priors enjoy many favorable properties in comparison and clustering of the estimated dynamic latent factors. Comparing estimated latent factors involves both adjacent and long-term comparisons, with the time range of comparison considered as a variable. Under certain conditions, we demonstrate that the posterior distribution attains the minimax optimal rate up to logarithmic factors. In terms of computation, we present a structured mean-field variational inference framework that balances optimal posterior inference with computational scalability, exploiting both the dependence among components and across time. The framework can accommodate a wide variety of models, including dynamic matrix factorization, latent space models for networks and low-rank tensors. The effectiveness of our methodology is demonstrated through extensive simulations and real-world data analysis.
翻译:动态关系数据的因式融合收缩模型
现代数据科学应用经常涉及具有动态结构的复杂关系数据。在经历干预的系统中,通常会观察到动态关系数据的突然变化。在这种情况下,我们考虑一个因式融合收缩模型,其中所有分解因子都被动态收缩到群体融合结构,其中通过对分解矩阵的行向量的连续差值施加全局局部收缩先验来获得收缩。所提出的先验在估计的动态潜在因子的比较和聚类方面具有许多有利的特性。比较估计的潜在因子涉及到邻近的和长期的比较,考虑到比较的时间范围作为变量。在某些条件下,我们证明了后验分布达到了迄今为止的最小最大率(最优比率)直到对数因子。在计算方面,我们提出了一种结构化的均场变分推理框架,平衡了最优后验推理和计算可扩展性,利用了组件和时间之间的依赖关系。该框架可以容纳各种模型,包括动态矩阵因子分解、网络潜空间模型和低秩张量。我们通过广泛的模拟和实际数据分析展示了我们方法的有效性。