This work identifies and attempts to address a fundamental limitation of implicit neural representations with sinusoidal activation. The fitting error of SIRENs is highly sensitive to the target frequency content and to the choice of initialization. In extreme cases, this sensitivity leads to a spectral bottleneck that can result in a zero-valued output. This phenomenon is characterized by analyzing the evolution of activation spectra and the empirical neural tangent kernel (NTK) during the training process. An unfavorable distribution of energy across frequency modes was noted to give rise to this failure mode. Furthermore, the effect of Gaussian perturbations applied to the baseline uniformly initialized weights is examined, showing how these perturbations influence activation spectra and the NTK eigenbasis of SIREN. Overall, initialization emerges as a central factor governing the evolution of SIRENs, indicating the need for adaptive, target-aware strategies as the target length increases and fine-scale detail becomes essential. The proposed weight initialization scheme (WINNER) represents a simple ad hoc step in this direction and demonstrates that fitting accuracy can be significantly improved by modifying the spectral profile of network activations through a target-aware initialization. The approach achieves state-of-the-art performance on audio fitting tasks and yields notable improvements in image fitting tasks.
翻译:本研究识别并尝试解决具有正弦激活的隐式神经表示的一个基本限制。SIREN的拟合误差对目标频率内容及初始化选择高度敏感。在极端情况下,这种敏感性会导致频谱瓶颈,从而产生零值输出。通过分析训练过程中激活谱与经验神经正切核的演化,我们刻画了该现象的特征。研究发现,能量在频率模式间的不利分布是导致此失效模式的原因。此外,本文考察了高斯扰动对基线均匀初始化权重的影响,揭示了这些扰动如何影响SIREN的激活谱与NTK特征基。总体而言,初始化成为主导SIREN演化的核心因素,这表明随着目标长度增加和精细细节变得至关重要,需要采用自适应的、目标感知的策略。所提出的权重初始化方案WINNER代表了朝此方向的一个简单特设步骤,并证明通过目标感知的初始化改变网络激活的频谱轮廓,可显著提升拟合精度。该方法在音频拟合任务中实现了最先进的性能,并在图像拟合任务中取得了显著改进。