" 无后空对对 " :通过正规化的三边目标改进计量学习 " 。 (No Pairs Left Behind: Improving Metric Learning with Regularized Triplet Objective)

We propose a novel formulation of the triplet objective function that improves metric learning without additional sample mining or overhead costs. Our approach aims to explicitly regularize the distance between the positive and negative samples in a triplet with respect to the anchor-negative distance. As an initial validation, we show that our method (called No Pairs Left Behind [NPLB]) improves upon the traditional and current state-of-the-art triplet objective formulations on standard benchmark datasets. To show the effectiveness and potentials of NPLB on real-world complex data, we evaluate our approach on a large-scale healthcare dataset (UK Biobank), demonstrating that the embeddings learned by our model significantly outperform all other current representations on tested downstream tasks. Additionally, we provide a new model-agnostic single-time health risk definition that, when used in tandem with the learned representations, achieves the most accurate prediction of subjects' future health complications. Our results indicate that NPLB is a simple, yet effective framework for improving existing deep metric learning models, showcasing the potential implications of metric learning in more complex applications, especially in the biological and healthcare domains.

翻译：我们提出三重目标功能的新提法,在不增加采样或间接费用的情况下改进衡量学习,不增加采样或间接费用。我们的方法旨在明确规范在锚阴距离上三重的正负抽样之间的距离。作为初步验证,我们展示了我们的方法(称为“无对左后方” )改进了标准基准数据集的传统和目前最先进的三重目标配方。为了显示NPLB在现实世界复杂数据上的有效性和潜力,我们评估了我们在大规模保健数据集(UK Biobank)上的做法,表明我们模型所学的嵌入大大超越了所有其他在测试的下游任务上的当前表现。此外,我们提供了一个新的模型-不可忽略的单一时间健康风险定义,在与所学的表述同时使用时,可以对各主题的未来健康并发症作出最准确的预测。我们的结果表明,NPLB是一个简单、但有效的框架,用于改进现有的深层次的计量学习模型,显示在更复杂的应用中,特别是在生物和保健领域,衡量指标学习的潜在影响。

相关内容

度量学习

关注 3372

度量学习的目的为了衡量样本之间的相近程度，而这也正是模式识别的核心问题之一。大量的机器学习方法，比如K近邻、支持向量机、径向基函数网络等分类方法以及K-means聚类方法，还有一些基于图的方法，其性能好坏都主要有样本之间的相似度量方法的选择决定。度量学习通常的目标是使同类样本之间的距离尽可能缩小，不同类样本之间的距离尽可能放大。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

专知会员服务

24+阅读 · 2020年3月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日