Previous literature on unsupervised learning focused on designing structural priors with the aim of learning meaningful features. However, this was done without considering the description length of the learned representations which is a direct and unbiased measure of the model complexity. In this paper, first we introduce the $\varphi$ metric that evaluates unsupervised models based on their reconstruction accuracy and the degree of compression of their internal representations. We then present and define two activation functions (Identity, ReLU) as base of reference and three sparse activation functions (top-k absolutes, Extrema-Pool indices, Extrema) as candidate structures that minimize the previously defined $\varphi$. We lastly present Sparsely Activated Networks (SANs) that consist of kernels with shared weights that, during encoding, are convolved with the input and then passed through a sparse activation function. During decoding, the same weights are convolved with the sparse activation map and subsequently the partial reconstructions from each weight are summed to reconstruct the input. We compare SANs using the five previously defined activation functions on a variety of datasets (Physionet, UCI-epilepsy, MNIST, FMNIST) and show that models that are selected using $\varphi$ have small description representation length and consist of interpretable kernels.
翻译:先前关于未经监督的学习的文献侧重于设计结构前程,目的是学习有意义的特征。然而,这样做时没有考虑作为衡量模型复杂性的直接和不偏不倚度量的所学表现的描述长度。首先,我们引入了基于其重建精确度和内部陈述压缩程度评价未经监督模型的美元标准。然后我们提出并定义了两个启动功能(身份、RELU)作为参考基础,以及三个稀疏的启动功能(顶端绝对值、Extrema-pool指数、Extrema)作为将先前定义的美元降至最低的候选结构。我们使用先前定义的五种SANSARSI活性网络(SAN)作为比较,这些网络由具有共同重量的内核组成,在编码过程中与输入混杂,然后通过一个稀疏的激活功能传递。在解过程中,同样的重量与稀疏的激活地图交织在一起,随后从每个重量中进行的部分重建,以重建输入。我们使用先前定义的五种SAN、SANUANUNI的驱动性网络(SAN-MI)的描述,在各种数据解释中包括了IMFMS-ISl。