用于高维机器学习的中场方法和算法视角 (Mean-field methods and algorithmic perspectives for high-dimensional machine learning)

The main difficulty that arises in the analysis of most machine learning algorithms is to handle, analytically and numerically, a large number of interacting random variables. In this Ph.D manuscript, we revisit an approach based on the tools of statistical physics of disordered systems. Developed through a rich literature, they have been precisely designed to infer the macroscopic behavior of a large number of particles from their microscopic interactions. At the heart of this work, we strongly capitalize on the deep connection between the replica method and message passing algorithms in order to shed light on the phase diagrams of various theoretical models, with an emphasis on the potential differences between statistical and algorithmic thresholds. We essentially focus on synthetic tasks and data generated in the teacher-student paradigm. In particular, we apply these mean-field methods to the Bayes-optimal analysis of committee machines, to the worst-case analysis of Rademacher generalization bounds for perceptrons, and to empirical risk minimization in the context of generalized linear models. Finally, we develop a framework to analyze estimation models with structured prior informations, produced for instance by deep neural networks based generative models with random weights.

翻译：分析大多数机器学习算法过程中出现的主要困难在于从分析和数字上处理大量互动随机变量。在这份博士手稿中,我们重新审视了一种基于无序系统统计物理工具的方法。通过丰富的文献开发,它们精确地设计来推断微小相互作用中大量微粒的宏观行为。在这项工作的核心,我们大力利用复制方法与信息传递算法之间的深层联系,以便了解各种理论模型的阶段图,重点是统计和算法阈值之间的潜在差异。我们基本上侧重于教师-学生范式中产生的合成任务和数据。特别是,我们将这些平均领域方法应用于对委员会机器的巴耶斯最佳分析,用于对Rademacher的透视带进行最坏情况分析,并用于在一般线性模型中将经验风险降到最低。最后,我们开发了一个框架,以结构化的先前信息来分析模型,例如由基于随机重量的深层神经网络基因分析模型生成的模型。

相关内容

Machine Learning

关注 2241

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

专知会员服务

39+阅读 · 2020年11月3日

【斯坦福大学博士论文】大规模和高维统计学习方法和算法，147页pdf， Large-scale and high-dimensional statistical learning methods and algorithms