We consider the problem of the extraction of semantic attributes, supervised only with classification labels. For example, when learning to classify images of birds into species, we would like to observe the emergence of features that zoologists use to classify birds. To tackle this problem, we propose training a neural network with discrete features in the last layer, which is followed by two heads: a multi-layered perceptron (MLP) and a decision tree. Since decision trees utilize simple binary decision stumps we expect those discrete features to obtain semantic meaning. We present a theoretical analysis as well as a practical method for learning in the intersection of two hypothesis classes. Our results on multiple benchmarks show an improved ability to extract a set of features that are highly correlated with the set of unseen attributes.
翻译:我们考虑提取语义属性的问题,仅以分类标签加以监督。例如,当学会将鸟类图像分类为物种时,我们想观察动物学家用来对鸟类进行分类的特征的出现。为了解决这一问题,我们提议培训一个在最后一层具有离散特征的神经网络,然后有两个头:多层感应器(MLP)和一棵决策树。由于决策树使用简单的二进制决定立方(MLP)和一棵决策树。由于我们期望这些离散的特征具有语义意义。我们提出了一个理论分析以及两个假设类交叉学习的实用方法。我们在多个基准上的结果显示,更有能力提取一系列与一组看不见属性高度相关的特征。