重新评估“分类和计数”量化方法 (Re-Assessing the "Classify and Count" Quantification Method)

from arxiv, This is the final version of the paper, identical to the one that is going to appear on the Proceedings of the 43rd European Conference on Information Retrieval (ECIR 2021)

Learning to quantify (a.k.a.\ quantification) is a task concerned with training unbiased estimators of class prevalence via supervised learning. This task originated with the observation that "Classify and Count" (CC), the trivial method of obtaining class prevalence estimates, is often a biased estimator, and thus delivers suboptimal quantification accuracy; following this observation, several methods for learning to quantify have been proposed that have been shown to outperform CC. In this work we contend that previous works have failed to use properly optimised versions of CC. We thus reassess the real merits of CC (and its variants), and argue that, while still inferior to some cutting-edge methods, they deliver near-state-of-the-art accuracy once (a) hyperparameter optimisation is performed, and (b) this optimisation is performed by using a true quantification loss instead of a standard classification-based loss. Experiments on three publicly available binary sentiment classification datasets support these conclusions.

翻译：学习量化(a.k.a.\量化)是一项任务,涉及通过监督的学习培训无偏见的班级流行估计员。这项任务源于以下观察,即“分类和计数”(CC)是获得班级流行率估计的琐碎方法,通常是一个偏颇的估测器,因此提供了低于最佳的量化准确度;根据这一观察,提出了数种学习量化方法,这些方法被证明优于CC。在这项工作中,我们争论的是,以往的工程没有适当优化使用CC的版本。因此,我们重新评估CC(及其变种)的真正优点,并争论说,虽然它们仍然低于某些尖端方法,但一旦(a) 进行了超参数优化,而且(b) 这种优化是通过使用真正的量化损失而不是标准的分类损失来完成的。对三种公开的二元感应感分类数据集的实验支持这些结论。