This paper proposes a novel multivariate definition of statistical dependence using a functional methodology inspired by Alfred R\'enyi. We define a new symmetric and self-adjoint cross density kernel through a recursive bidirectional statistical mapping between conditional densities of continuous random processes, which estimates their statistical dependence. Therefore, the kernel eigenspectrum is proposed as a new multivariate statistical dependence measure, and the formulation requires fewer assumptions about the data generation model than current methods. The measure can also be estimated from realizations. The proposed functional maximum correlation algorithm (FMCA) is applied to a learning architecture with two multivariate neural networks. The FMCA optimal solution is an equilibrium point that estimates the eigenspectrum of the cross density kernel. Preliminary results with synthetic data and medium size image datasets corroborate the theory. Four different strategies of applying the cross density kernel are thoroughly discussed and implemented to show the versatility and stability of the methodology, and it transcends supervised learning. When two random processes are high-dimensional real-world images and white uniform noise, respectively, the algorithm learns a factorial code i.e., the occurrence of a code guarantees that a certain input in the training set was present, which is quite important for feature learning.
翻译:本文建议采用阿尔弗雷德·R\'enyi所启发的功能性方法,对统计依赖性进行新的多变量定义。我们通过连续随机过程的有条件密度(估计其统计依赖性)之间的双向双向统计绘图,定义一个新的对称和自我连接的交叉密度内核。因此,提议将内核对统计依赖性作为新的多变量统计依赖性计量标准,而该公式要求的数据生成模型假设也比目前方法少。该计量也可以从实现中估算。拟议功能最大关联算法(FMCA)适用于两个多变量神经网络的学习结构。FMCA最佳解决方案是一个平衡点,用来估计交叉密度内核的微光度。合成数据和中等大小图像数据集的初步结果证实了这一理论。应用跨密度内核的四种不同的战略都经过彻底讨论和实施,以显示方法的多功能性和稳定性,而且它超越了监督的学习范围。当两个随机程序是高维度真实世界图像和白色统一噪音时,两种随机算法是相当重要的计算法,其中的一种是当前学习一个重要要素的数学特性。