Inspired by neuronal diversity in the biological neural system, a plethora of studies proposed to design novel types of artificial neurons and introduce neuronal diversity into artificial neural networks. Recently proposed quadratic neuron, which replaces the inner-product operation in conventional neurons with a quadratic one, have achieved great success in many essential tasks. Despite the promising results of quadratic neurons, there is still an unresolved issue: \textit{Is the superior performance of quadratic networks simply due to the increased parameters or due to the intrinsic expressive capability?} Without clarifying this issue, the performance of quadratic networks is always suspicious. Additionally, resolving this issue is reduced to finding killer applications of quadratic networks. In this paper, with theoretical and empirical studies, we show that quadratic networks enjoy parametric efficiency, thereby confirming that the superior performance of quadratic networks is due to the intrinsic expressive capability. This intrinsic expressive ability comes from that quadratic neurons can easily represent nonlinear interaction, while it is hard for conventional neurons. Theoretically, we derive the approximation efficiency of the quadratic network over conventional ones in terms of real space and manifolds. Moreover, from the perspective of the Barron space, we demonstrate that there exists a functional space whose functions can be approximated by quadratic networks in a dimension-free error, but the approximation error of conventional networks is dependent on dimensions. Empirically, experimental results on synthetic data, classic benchmarks, and real-world applications show that quadratic models broadly enjoy parametric efficiency, and the gain of efficiency depends on the task.
翻译:在生物神经系统的神经多样性的启发下, 大量研究建议设计新型人工神经元并将神经多样性引入人造神经网络。 最近提出的二次神经神经, 以二次研究取代常规神经神经元的内产操作, 在许多基本任务中取得了巨大的成功。 尽管四面神经系取得了令人乐观的结果, 但仍有一个尚未解决的问题。 \ textit { 是四面网络的优异性能, 只是因为参数增加或内在的表达能力?} 不澄清这一问题, 二次网络的性能总是令人怀疑的。 此外, 解决该问题已经减少,以二次网络的致命应用。 在本论文中,通过理论和实验性研究,我们表明四面网络的超强性能是由于内在的表达能力。 这种内在的表达能力来自四面神经系可以很容易地代表非线性互动, 而对于常规神经系来说则很难。 理论上, 我们从二次网络的接近性效率的角度来发现二次网络在二次网络上的致命性应用。 在常规的轨道上,我们从空间的轨道上展示一个真实的轨道, 。</s>