This work aims to address the long-established problem of learning diversified representations. To this end, we combine information-theoretic arguments with stochastic competition-based activations, namely Stochastic Local Winner-Takes-All (LWTA) units. In this context, we ditch the conventional deep architectures commonly used in Representation Learning, that rely on non-linear activations; instead, we replace them with sets of locally and stochastically competing linear units. In this setting, each network layer yields sparse outputs, determined by the outcome of the competition between units that are organized into blocks of competitors. We adopt stochastic arguments for the competition mechanism, which perform posterior sampling to determine the winner of each block. We further endow the considered networks with the ability to infer the sub-part of the network that is essential for modeling the data at hand; we impose appropriate stick-breaking priors to this end. To further enrich the information of the emerging representations, we resort to information-theoretic principles, namely the Information Competing Process (ICP). Then, all the components are tied together under the stochastic Variational Bayes framework for inference. We perform a thorough experimental investigation for our approach using benchmark datasets on image classification. As we experimentally show, the resulting networks yield significant discriminative representation learning abilities. In addition, the introduced paradigm allows for a principled investigation mechanism of the emerging intermediate network representations.
翻译:这项工作旨在解决长期存在的学习多样化代表制的问题。 为此,我们将信息理论理论与基于竞争的激励机制(即Stochastestic local Winner-Takes-All(LWTA)单位)结合起来。在这方面,我们抛弃了代表性学习中通常使用的传统深层架构,这些架构依赖于非线性激活;相反,我们用当地和随机竞争的线性单位来取代这些架构。在这一背景下,每个网络层产生稀少的产出,这取决于组成竞争对手集团的单位之间的竞争结果。我们采用竞争机制的质疑性论据,进行事后抽样抽样,以确定每个街区的胜出者。我们进一步缩小了被考虑过的网络,使之有能力推断出对模拟手头数据至关重要的网络子部分;我们为此设置了适当的突破性前台。为了进一步丰富新兴代表制的信息,我们诉诸信息理论性原则,即信息比较进程(IPC),然后,所有组成部分都结合竞争机制,进行后台式抽样取样。我们用一个实验性分析性网络的模型性分析能力,我们用一个实验性分析性模型性分析能力来研究。