Recently, multi-layer perceptrons (MLPs) with ReLU activations have enabled new photo-realistic rendering techniques by encoding scene properties using their weights. For these models, termed coordinate based MLPs, sinusoidal encodings are necessary in allowing for convergence to the high frequency components of the signal due to their severe spectral bias. Previous work has explained this phenomenon using Neural Tangent Kernel (NTK) and Fourier analysis. However, the kernel regime does not expose the properties of the network that induce this behavior, and the Fourier decomposition is global, not allowing for insight on the network's local dynamics. A new interpretation of spectral bias directly through ReLU network computations would expose their limitations in dense settings, while providing a clearer explanation as to how this behavior emerges during the learning process. In this paper, we provide the first study of spectral bias in a coordinate based MLP through its activation regions and gradient descent dynamics, specifically using gradient confusion. We relate the confusion between inputs to the distinctiveness of their activation patterns, and find higher amounts of confusion when expressive power is limited. This leads to slower convergence to the high frequency components of the signal, which is magnified by the density of coordinates. Additionally, this method allows us to analyze the properties of the activation regions as spectral bias is reduced, in which we find distinct dynamics.
翻译:最近,使用 ReLU 启动的多层感应器(MLPs)通过使用其重量来编码显示场景特性,使得新的光现实化技术得以使用这些功能。对于这些模型,即基于坐标的 MLPs,由于严重的光谱偏差,对信号的高频组成部分的趋同是必要的。以前的工作用Neal Tangent Cernel(NTK)和Fourier(Fleier)的分析解释了这种现象。然而,内核系统并没有暴露引起这种行为的网络特性,而Fourier分解是全球性的,无法使人们洞察到网络的本地动态。通过RELU网络计算直接对光谱偏差进行新的解释,将暴露其在密度环境中的局限性,同时更清楚地解释在学习过程中这种行为是如何出现的。在本文中,我们通过基于MLP的协调,通过它的激活区域和梯度下位下位的下位性能动态,特别是使用梯度混淆,首次研究光谱偏差的偏差性。我们把投入的混淆与其激活模式的特性联系起来,而发现在高频谱系中,当显性能量的频率受限制时,我们发现更高的混乱程度,这种感光谱系的特性会使得这种感变变的特性变得较慢。