The propensity model introduced by Jain et al. 2016 has become a standard approach for dealing with missing and long-tail labels in extreme multi-label classification (XMLC). In this paper, we critically revise this approach showing that despite its theoretical soundness, its application in contemporary XMLC works is debatable. We exhaustively discuss the flaws of the propensity-based approach, and present several recipes, some of them related to solutions used in search engines and recommender systems, that we believe constitute promising alternatives to be followed in XMLC.
翻译:Jain等人(2016年)推出的偏好模式已成为处理极端多标签分类中缺失和长尾标签的标准方法(XMLC ) 。 在本文中,我们批判性地修订了这一方法,表明尽管其理论健全,但在当代XMLC作品中的应用是值得商榷的。 我们详尽地讨论了基于偏好的方法的缺陷,并提出了几种食谱,其中一些食谱与搜索引擎和建议系统所使用的解决方案有关,我们认为这构成了在XMLC中可以遵循的有希望的替代方法。