We consider a generalization of the classifier-based density-ratio estimation task to a quasiprobabilistic setting where probability densities can be negative. The problem with most loss functions used for this task is that they implicitly define a relationship between the optimal classifier and the target quasiprobabilistic density ratio which is discontinuous or not surjective. We address these problems by introducing a convex loss function that is well-suited for both probabilistic and quasiprobabilistic density ratio estimation. To quantify performance, an extended version of the Sliced-Wasserstein distance is introduced which is compatible with quasiprobability distributions. We demonstrate our approach on a real-world example from particle physics, of di-Higgs production in association with jets via gluon-gluon fusion, and achieve state-of-the-art results.
翻译:本文考虑将基于分类器的密度比估计任务推广至准概率场景,其中概率密度允许为负值。该任务中多数损失函数存在一个共同问题:它们隐含定义的最优分类器与目标准概率密度比之间的关系要么不连续,要么非满射。我们通过引入一种适用于概率与准概率密度比估计的凸损失函数来解决这些问题。为量化性能,本文提出了切片瓦瑟斯坦距离的扩展版本,该度量与准概率分布具有兼容性。我们在粒子物理学的实际案例——胶子-胶子融合过程中伴随喷流产生的双希格斯玻色子生成——中验证了所提方法,并取得了当前最优的结果。