An old problem in multivariate statistics is that linear Gaussian models are often unidentifiable, i.e. some parameters cannot be uniquely estimated. In factor analysis, an orthogonal rotation of the factors is unidentifiable, while in linear regression, the direction of effect cannot be identified. For such linear models, non-Gaussianity of the (latent) variables has been shown to provide identifiability. In the case of factor analysis, this leads to independent component analysis, while in the case of the direction of effect, non-Gaussian versions of structural equation modelling solve the problem. More recently, we have shown how even general nonparametric nonlinear versions of such models can be estimated. Non-Gaussianity is not enough in this case, but assuming we have time series, or that the distributions are suitably modulated by some observed auxiliary variables, the models are identifiable. This paper reviews the identifiability theory for the linear and nonlinear cases, considering both factor analytic models and structural equation models.
翻译:在多变量统计中,一个老的问题是线性高斯模型往往无法辨别,即某些参数无法单独估计。在要素分析中,参数的正方旋转是无法辨别的,而在线性回归中,效力的方向则无法辨别。对于这种线性模型来说,(老的)变量的非古西尼特(古西尼特)变量已证明提供了可辨识性。在要素分析中,这导致独立的组成部分分析,而在效果方向方面,结构方程模型的非古西文版本解决了问题。最近,我们甚至展示了这些模型的普通非参数非线性非线性版本如何可以估计。在本案中,非古西尼特还不够,但假设我们有时间序列,或者分布由观察到的一些辅助变量适当调整,这些模型是可以辨认的。本文审查了线性和非线性案例的可辨性理论,同时考虑到要素分析模型和结构方程模型。