A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution. The lack of positive labels can lead to low classification accuracy for these classes. Towards addressing this issue, we observe that there exist natural correlations and anti-correlations among human-object interactions. In this paper, we model the correlations as action co-occurrence matrices and present techniques to learn these priors and leverage them for more effective training, especially on rare classes. The efficacy of our approach is demonstrated experimentally, where the performance of our approach consistently improves over the state-of-the-art methods on both of the two leading HOI detection benchmark datasets, HICO-Det and V-COCO.
翻译:人体与物体相互作用(HOI)任务的一个常见问题是,许多HOI班级只有为数不多的贴标签的例子,导致培训组长期分发,缺乏正面标签可能导致这些班级的分类准确性低。在解决这一问题时,我们发现人类与物体相互作用之间存在自然关联和反关联。在本文中,我们将这些关联作为共同行动矩阵和现有技术来模拟,以学习这些前科,并利用这些前科进行更有效的培训,特别是在稀有班级上。我们的方法的有效性通过实验性的方式得到证明,我们的方法在HICO-Det和V-CO这两个主要HOI检测基准数据集方面不断改进了最先进的方法。