从相关性判决中学习到排名 (Learning to Rank from Relevance Judgments Distributions) - 专知论文

会员服务 ·

0

秩 · Performer · MoDELS · 学成 · 损失函数（机器学习） ·

2022 年 2 月 13 日

Learning to Rank from Relevance Judgments Distributions

翻译：从相关性判决中学习到排名

Alberto Purpura,Gianmaria Silvello,Gian Antonio Susto

Learning to Rank (LETOR) algorithms are usually trained on annotated corpora where a single relevance label is assigned to each available document-topic pair. Within the Cranfield framework, relevance labels result from merging either multiple expertly curated or crowdsourced human assessments. In this paper, we explore how to train LETOR models with relevance judgments distributions (either real or synthetically generated) assigned to document-topic pairs instead of single-valued relevance labels. We propose five new probabilistic loss functions to deal with the higher expressive power provided by relevance judgments distributions and show how they can be applied both to neural and GBM architectures. Moreover, we show how training a LETOR model on a sampled version of the relevance judgments from certain probability distributions can improve its performance when relying either on traditional or probabilistic loss functions. Finally, we validate our hypothesis on real-world crowdsourced relevance judgments distributions. Overall, we observe that relying on relevance judgments distributions to train different LETOR models can boost their performance and even outperform strong baselines such as LambdaMART on several test collections.

翻译：学习排名( LETOR) 算法通常在附加注释的Corpora (LETOR) 上接受培训, 每一个可用的文档专题配对都配有单一的相关标签。在 Cranfield 框架内, 关联标签的产生是因为将多种专家整理的人类评估或众源评估合并在一起。在本文中, 我们探索如何培训LETOR 模型, 配有用于文档专题配对( 无论是真实的还是合成生成的) 的相关判断分布, 而不是单一价值的关联标签。我们提议了五个新的概率损失功能, 以处理相关性判断分布所提供的更高表达力, 并展示它们如何同时适用于神经和 GBM 结构。此外, 我们展示了在依赖传统或概率分布的概率分配中, 如何用 LETOR 模型的样本化模型来培训其性能。最后, 我们验证了我们关于真实世界人群组合关联值相关判断分布的假设。总体而言, 我们观察到, 依赖关联性判断分布来培训不同的 LETOR 模型可以提高它们的性, 甚至超越一些测试收藏的强基线, 例如 LambdaMART 。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【简明书】机器学习用例书册，76页pdf，The Big Book of Machine Learning Use Cases

【简明书】机器学习用例书册，76页pdf，The Big Book of Machine Learning Use Cases

专知会员服务

67+阅读 · 2021年12月22日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

随机矩阵/数组形式高维数据的充分降维：统计理论、方法及其应用

国家自然科学基金

2+阅读 · 2013年12月31日

IgA肾病IgA1糖化缺陷及补体活化异常相关基因的精细定位及功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

复杂大化工过程的分布式广义预测控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

整合常见和罕见变异进行肺癌风险预测的统计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

甘肃金鳟生长性状候选基因的关联分析及功能标记开发

国家自然科学基金

0+阅读 · 2011年12月31日

骨质疏松性骨折重要候选基因的关联研究

国家自然科学基金

0+阅读 · 2009年12月31日

融合GeneRank与机器学习方法实现小鼠生精过程基因筛选和功能预测

国家自然科学基金

0+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

FedChain: Chained Algorithms for Near-Optimal Communication Cost in Federated Learning

Arxiv

0+阅读 · 2022年4月20日

OneFlow: Redesign the Distributed Deep Learning Framework from Scratch

Arxiv

0+阅读 · 2022年4月19日

Distributed Learning of Deep Neural Networks using Independent Subnet Training

Arxiv

2+阅读 · 2022年4月18日

Selection of proposal distributions for multiple importance sampling

Arxiv

0+阅读 · 2022年4月18日

Joint Multi-view Unsupervised Feature Selection and Graph Learning

Arxiv

0+阅读 · 2022年4月18日

ZeroIn: Characterizing the Data Distributions of Commits in Software Repositories

Arxiv

0+阅读 · 2022年4月16日

The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents

Arxiv

0+阅读 · 2022年4月16日

Testing distributional assumptions of learning algorithms

Arxiv

0+阅读 · 2022年4月14日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【简明书】机器学习用例书册，76页pdf，The Big Book of Machine Learning Use Cases

【简明书】机器学习用例书册，76页pdf，The Big Book of Machine Learning Use Cases

专知会员服务

67+阅读 · 2021年12月22日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

FedChain: Chained Algorithms for Near-Optimal Communication Cost in Federated Learning

Arxiv

0+阅读 · 2022年4月20日

OneFlow: Redesign the Distributed Deep Learning Framework from Scratch

Arxiv

0+阅读 · 2022年4月19日

Distributed Learning of Deep Neural Networks using Independent Subnet Training

Arxiv

2+阅读 · 2022年4月18日

Selection of proposal distributions for multiple importance sampling

Arxiv

0+阅读 · 2022年4月18日

Joint Multi-view Unsupervised Feature Selection and Graph Learning

Arxiv

0+阅读 · 2022年4月18日

ZeroIn: Characterizing the Data Distributions of Commits in Software Repositories

Arxiv

0+阅读 · 2022年4月16日

The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents

Arxiv

0+阅读 · 2022年4月16日

Testing distributional assumptions of learning algorithms

Arxiv

0+阅读 · 2022年4月14日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

相关基金

随机矩阵/数组形式高维数据的充分降维：统计理论、方法及其应用

国家自然科学基金

2+阅读 · 2013年12月31日

IgA肾病IgA1糖化缺陷及补体活化异常相关基因的精细定位及功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

复杂大化工过程的分布式广义预测控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于关联分析的野生毛花猕猴桃AsA富集相关基因发掘及功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

整合常见和罕见变异进行肺癌风险预测的统计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

甘肃金鳟生长性状候选基因的关联分析及功能标记开发

国家自然科学基金

0+阅读 · 2011年12月31日

骨质疏松性骨折重要候选基因的关联研究

国家自然科学基金

0+阅读 · 2009年12月31日

融合GeneRank与机器学习方法实现小鼠生精过程基因筛选和功能预测

国家自然科学基金

0+阅读 · 2009年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员