Optimal transport (OT) is known to be sensitive against outliers because of its marginal constraints. Outlier robust OT variants have been proposed based on the definition that outliers are samples which are expensive to move. In this paper, we show that this definition is restricted by considering the case where outliers are closer to the target measure than clean samples. We show that outlier robust OT fully transports these outliers leading to poor performances in practice. To tackle these outliers, we propose to detect them by relying on a classifier trained with adversarial training to classify source and target samples. A sample is then considered as an outlier if the prediction from the classifier is different from its assigned label. To decrease the influence of these outliers in the transport problem, we propose to either remove them from the problem or to increase the cost of moving them by using the classifier prediction. We show that we successfully detect these outliers and that they do not influence the transport problem on several experiments such as gradient flows, generative models and label propagation.
翻译:最佳运输(OT) 因其边际限制,已知对离子体具有敏感性。 外部强势的OT变异体是根据离子体是昂贵的移动样品的定义提出的。 在本文中,我们通过考虑离子体比清洁样品更接近目标测量的情况来显示这一定义受到限制。 我们显示, 外部强势的OT充分运输这些离子体导致实际性能不佳。 为了对付这些离子体, 我们提议通过依靠受过对抗训练的分类师来对来源和目标样品进行分类。 如果分类者的预测与指定的标签不同, 样本则被视为离子体。 为了减少这些离子体在运输问题上的影响, 我们提议要么从问题中排除它们, 要么通过使用分类预测来增加移动它们的成本。 我们表明,我们成功地检测了这些离子,它们不会影响诸如梯流、 组合模型和标签传播等若干实验的运输问题。