We consider the problem of supervised classification, such that the features that the network extracts match an unseen set of semantic attributes, without any additional supervision. For example, when learning to classify images of birds into species, we would like to observe the emergence of features that zoologists use to classify birds. We propose training a neural network with discrete top-level activations, which is followed by a multi-layered perceptron (MLP) and a parallel decision tree. We present a theoretical analysis as well as a practical method for learning in the intersection of two hypothesis classes. Since real-world features are often sparse, a randomized sparsity regularization is also applied. Our results on multiple benchmarks show an improved ability to extract a set of features that are highly correlated with the set of unseen attributes.
翻译:我们考虑了监督分类的问题,例如网络在没有任何额外监督的情况下提取的特征与一组看不见的语义特征相匹配。例如,在学习将鸟类图像分类为物种时,我们想观察动物学家用来对鸟类分类的特征的出现。我们提议培训一个神经网络,其启动功能为离散的顶层,随后是多层感应器(MLP)和平行的决策树。我们提出了一个理论分析,以及在两个假设类交汇处学习的实用方法。由于现实世界特征往往很少,因此也应用随机随机的聚变规范。我们在多个基准上的结果显示,更有能力提取一系列与一组看不见属性高度相关的特征。