This study proposes a novel approach for solving the PU learning problem based on an anomaly-detection strategy. Latent encodings extracted from positive-labeled data are linearly combined to acquire new samples. These new samples are used as embeddings to increase the density of positive-labeled data and, thus, define a boundary that approximates the positive class. The further a sample is from the boundary the more it is considered as a negative sample. Once a set of negative samples is obtained, the PU learning problem reduces to binary classification. The approach, named Dens-PU due to its reliance on the density of positive-labeled data, was evaluated using benchmark image datasets, and state-of-the-art results were attained.
翻译:本研究提出了一种新的方法来解决基于异常检测策略的正负样本学习问题。从正样本数据中提取的潜在编码进行线性组合,以获取新样本。这些新样本用作嵌入,以增加正标记数据的密度,从而定义逼近正类的边界。样本距离边界越远,它被视为负样本的概率越高。一旦获得一组负样本,PU学习问题就会降低到二元分类问题。该方法被命名为Dens-PU,因为它依赖于正例标签数据的密度,并使用基准图像数据集进行评估,取得了最先进的结果。