Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized weights for each new signal is inefficient. We propose applying standard meta-learning algorithms to learn the initial weight parameters for these fully-connected networks based on the underlying class of signals being represented (e.g., images of faces or 3D models of chairs). Despite requiring only a minor change in implementation, using these learned initial weights enables faster convergence during optimization and can serve as a strong prior over the signal class being modeled, resulting in better generalization when only partial observations of a given signal are available. We explore these benefits across a variety of tasks, including representing 2D images, reconstructing CT scans, and recovering 3D shapes and scenes from 2D image observations.
翻译:以协调为基础的神经表征显示,作为复杂低维信号的离散、基于阵列的表示方式的替代方法,有很大的希望。然而,从随机初始加权对每个新信号进行优化基于协调的网络效率低下。我们提议采用标准的元学习算法,根据所代表的信号基本类别(如脸部图像或3D制椅子模型)来学习这些完全连接的网络的初始加权参数。尽管在实施方面只需要稍作改动,但使用这些初始加权在优化过程中能够更快地趋同,并能够成为建模信号类之前的强力组合,从而在只对特定信号进行部分观测时,能够更全面地普及。我们探索了这些好处,包括代表2D图像、重建CT扫描和从2D图像观测中恢复3D形和场景。