Learning concepts that are consistent with human perception is important for Deep Neural Networks to win end-user trust. Post-hoc interpretation methods lack transparency in the feature representations learned by the models. This work proposes a guided learning approach with an additional concept layer in a CNN- based architecture to learn the associations between visual features and word phrases. We design an objective function that optimizes both prediction accuracy and semantics of the learned feature representations. Experiment results demonstrate that the proposed model can learn concepts that are consistent with human perception and their corresponding contributions to the model decision without compromising accuracy. Further, these learned concepts are transferable to new classes of objects that have similar concepts.
翻译:符合人类认知的学习概念对于深神经网络赢得最终用户信任十分重要; 热后解释方法在模型所学特征表述方面缺乏透明度; 这项工作提议采用有线电视新闻网架构中增加一个概念层的有指导的学习方法,在有线电视新闻网架构中增加一个概念层,以学习视觉特征和词组之间的联系; 我们设计一个客观功能,优化所学特征描述的预测准确性和语义; 实验结果显示,拟议的模型可以学习与人类认知相一致的概念,以及这些概念对示范决定的相应贡献,同时又不损害准确性; 此外,这些已学过的概念可转让给具有类似概念的新类型的物体。