Restricted Boltzmann Machines (RBMs) offer a versatile architecture for unsupervised machine learning that can in principle approximate any target probability distribution with arbitrary accuracy. However, the RBM model is usually not directly accessible due to its computational complexity, and Markov-chain sampling is invoked to analyze the learned probability distribution. For training and eventual applications, it is thus desirable to have a sampler that is both accurate and efficient. We highlight that these two goals generally compete with each other and cannot be achieved simultaneously. More specifically, we identify and quantitatively characterize three regimes of RBM learning: independent learning, where the accuracy improves without losing efficiency; correlation learning, where higher accuracy entails lower efficiency; and degradation, where both accuracy and efficiency no longer improve or even deteriorate. These findings are based on numerical experiments and heuristic arguments.
翻译:受限制的Boltzmann机器(RBMs)为不受监督的机器学习提供了一个多功能结构,原则上可以任意精确地近似于任何目标概率分布;然而,成果管理制模式由于计算复杂,通常不能直接获得,而且利用Markov链抽样分析所学的概率分布。因此,对于培训和最终应用来说,最好有一个既准确又有效率的取样员。我们强调,这两个目标一般是相互竞争,不能同时实现。更具体地说,我们确定并量化了三种成果管理制学习制度:独立学习,其准确性在不降低效率的情况下提高;相关学习,其准确性意味着效率降低;以及退化,其准确性和效率不再改善甚至恶化。这些调查结果基于数字实验和超理论。