Multivariate Gaussian regression is embedded into a general distributional regression framework using flexible additive predictors determining all distributional parameters. While this is relatively straightforward for the means of the multivariate dependent variable, it is more challenging for the full covariance matrix {\Sigma} due to two main difficulties: (i) ensuring positive-definiteness of {\Sigma} and (ii) regularizing the high model complexity. Both challenges are addressed by adopting a parameterization of {\Sigma} based on its basic or modified Cholesky decomposition, respectively. Unlike the decomposition into variances and a correlation matrix, the Cholesky decomposition guarantees positive-definiteness for any predictor values regardless of the distributional dimension. Thus, this enables linking all distributional parameters to flexible predictors without any joint constraints that would substantially complicate other parameterizations. Moreover, this approach enables regularization of the flexible additive predictors through penalized maximum likelihood or Bayesian estimation as for other distributional regression models. Finally, the Cholesky decomposition allows to reduce the number of parameters when the components of the multivariate dependent variable have a natural order (typically time) and a maximum lag can be assumed for the dependencies among the components.
翻译:多种变量回归嵌入一个通用分布回归框架,使用灵活的添加性预测器确定所有分布参数。对于多变量依赖变量变量的手段来说,这相对简单,但对于完整的共变矩阵矩阵 {Sigma} 更具挑战性,因为两个主要困难:(一) 确保正-确定 {Sigma} 和(二) 使高模型复杂性正规化。这两个挑战都通过基于基本或修改后的 CHolesky 分解的参数化来解决。与分解成差异和关联矩阵不同,Choolesky分解保证任何预测值的正-确定性,而不论分布层面如何。因此,这能够将所有分配参数与灵活的预测值联系起来,而没有任何联合制约,使其他参数的参数严重复杂化。此外,这一方法使得灵活的添加预测器能够通过最可能受抑制的可能性或巴伊斯估计与其他分布回归模型一样,实现正规化。最后,当多变量依赖性变量各组成部分具有假设的自然顺序时,Cho洛斯基分使参数数量减少。