Multi-label classification (MC) is a standard machine learning problem in which a data point can be associated with a set of classes. A more challenging scenario is given by hierarchical multi-label classification (HMC) problems, in which every prediction must satisfy a given set of hard constraints expressing subclass relationships between classes. In this paper, we propose C-HMCNN(h), a novel approach for solving HMC problems, which, given a network h for the underlying MC problem, exploits the hierarchy information in order to produce predictions coherent with the constraints and to improve performance. Furthermore, we extend the logic used to express HMC constraints in order to be able to specify more complex relations among the classes and propose a new model CCN(h), which extends C-HMCNN(h) and is again able to satisfy and exploit the constraints to improve performance. We conduct an extensive experimental analysis showing the superior performance of both C-HMCNN(h) and CCN(h) when compared to state-of-the-art models in both the HMC and the general MC setting with hard logical constraints.
翻译:多标签分类(MC)是一个标准的机器学习问题,其中数据点可以与一组类别联系起来。一个更具挑战性的情景是等级多标签分类(HMC)问题,其中每一项预测都必须满足表明各类别之间分级关系的一套硬性限制。在本文件中,我们提议C-HMCNN(h),这是解决HMC问题的一种新颖办法,鉴于对内在的MC问题有一个网络 h,我们利用等级信息得出与限制相一致的预测并改进性能。此外,我们扩大了用于表达HMC限制的逻辑,以便能够明确各类别之间更为复杂的关系,并提出新的CCN(h)模式,以扩展C-HMCNN(h),并再次能够满足和利用这些限制来改善业绩。我们进行了广泛的实验分析,显示C-HMCNN(h)和CCN(h)两种模式在与HMC和一般MC中具有困难逻辑限制的先进模型相比的优异性。