We propose a multivariate probability distribution that models a linear correlation between binary and continuous variables. The proposed distribution is a natural extension of the previously developed multivariate binary distribution based on Grassmann numbers. As an application of the proposed distribution, we develop a factor analysis for a mixture of continuous and binary variables. We also discuss improper solutions associated with maximum likelihood estimation of factor analysis. As a prescription to avoid improper solutions, we impose a constraint that the norms of each vector of a factor loading matrix are the same. We numerically validated the proposed factor analysis and norm constraint prescription by analyzing real datasets.
翻译:我们提出多变量概率分布,以模拟二进制变量和连续变量之间的线性关联。提议的分布是先前开发的多变量二进制分布的自然延伸,以格拉斯曼数字为基础。作为拟议分布的一种应用,我们为连续变量和二进制变量的混合进行系数分析。我们还讨论与最大可能估计系数分析相关的不适当解决办法。作为避免不正确解决办法的处方,我们限制要素装载矩阵的每个矢量的规范相同。我们通过分析真实数据集,对拟议要素分析和规范约束处方进行了量化验证。