完全连通的神经网络培训是$\ exptions\ mathbb{R}$- complete (Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete)

We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully connected neural network to fit a given set of data points. This problem is known as empirical risk minimization in the machine learning community. We show that the problem is $\exists\mathbb{R}$-complete. This complexity class can be defined as the set of algorithmic problems that are polynomial-time equivalent to finding real roots of a polynomial with integer coefficients. Furthermore, we show that arbitrary algebraic numbers are required as weights to be able to train some instances to optimality, even if all data points are rational. Our results hold even if the following restrictions are all added simultaneously. $\bullet$ There are exactly two output neurons. $\bullet$ There are exactly two input neurons. $\bullet$ The data has only 13 different labels. $\bullet$ The number of hidden neurons is a constant fraction of the number of data points. $\bullet$ The target training error is zero. $\bullet$ The ReLU activation function is used. This shows that even very simple networks are difficult to train. The result explains why typical methods for $\mathsf{NP}$-complete problems, like mixed-integer programming or SAT-solving, cannot train neural networks to global optimality, unless $\mathsf{NP}=\exists\mathbb{R}$. We strengthen a recent result by Abrahamsen, Kleist and Miltzow [NeurIPS 2021].

翻译：我们考虑的是找到双层完全连接的神经网络的最佳权重和偏差以适合给定的数据点的算法问题。这个问题被称为机器学习界的实验风险最小化。我们显示, 问题在于 $\ existents\ mathb{R} $- 已完成。这个复杂的类别可以定义为算法问题组, 相当于找到一个具有整数系数的多元数值的真正根基。此外, 我们显示, 任意的代数需要任意的代数, 才能使某些实例达到最佳性, 即使所有数据点都是合理的。我们的结果即使同时添加了以下的限制, 也维持着。 $\ bull $\ bullb{ $\ $\ $。完全有两个输入神经。 $\ balllete$ 数据只有13个不同的标签。 $\\ bulllete$ 隐藏的神经元数量是数据点的固定部分。 $\ ball$\ bell$ 和目标训练错误是零。 $\\ bull$ RELU imlentral 函数 ral presmaxral 。除非这个结果是很难。。。。。。。除非 robral_ 。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日