Sufficient dimension reduction is a powerful tool to extract core information hidden in the high-dimensional data and has potentially many important applications in machine learning tasks. However, the existing nonlinear sufficient dimension reduction methods often lack the scalability necessary for dealing with large-scale data. We propose a new type of stochastic neural network under a rigorous probabilistic framework and show that it can be used for sufficient dimension reduction for large-scale data. The proposed stochastic neural network is trained using an adaptive stochastic gradient Markov chain Monte Carlo algorithm, whose convergence is rigorously studied in the paper as well. Through extensive experiments on real-world classification and regression problems, we show that the proposed method compares favorably with the existing state-of-the-art sufficient dimension reduction methods and is computationally more efficient for large-scale data.
翻译:足够维度的减少是提取高维数据中隐藏的核心信息的有力工具,在机器学习任务中可能有许多重要的应用。然而,现有的非线性足够维度的减少方法往往缺乏处理大规模数据所需的伸缩性。我们提议在严格概率框架之下建立一个新型的随机神经网络,并表明它可用于为大规模数据提供足够的伸缩度。提议的随机神经网络使用适应性随机梯度梯度马尔科夫链 Monte Carlo算法进行培训,该算法的趋同也在文件中进行严格研究。我们通过对现实世界分类和回归问题的广泛试验,表明拟议方法与现有最先进的足够维度减少方法相比,在计算上对大规模数据更有效。