普遍化的无限因素化模式 (Generalized infinite factorization models)

Factorization models express a statistical object of interest in terms of a collection of simpler objects. For example, a matrix or tensor can be expressed as a sum of rank-one components. However, in practice, it can be challenging to infer the relative impact of the different components as well as the number of components. A popular idea is to include infinitely many components having impact decreasing with the component index. This article is motivated by two limitations of existing methods: (1) lack of careful consideration of the within component sparsity structure; and (2) no accommodation for grouped variables and other non-exchangeable structures. We propose a general class of infinite factorization models that address these limitations. Theoretical support is provided, practical gains are shown in simulation studies, and an ecology application focusing on modelling bird species occurrence is discussed.

翻译：集成化模型表示对收集简单对象感兴趣的统计目标,例如,矩阵或强项可以以一等成分的总和表示,但在实践中,推断不同组成部分的相对影响以及组成部分的数目可能具有挑战性。流行的想法是包括影响随着组成指数下降的无限许多组成部分。这一条的动机是现有方法的两种局限性:(1) 缺乏对组成部分散居结构的仔细考虑;(2) 无法容纳组合变量和其他不可交换结构。我们提出了处理这些局限性的无限因子化模型的一般类别。提供了理论支持,模拟研究中显示了实际收益,并讨论了侧重于模拟鸟类物种发生情况的生态应用。