click模型非偏差度的离线评估指标 (An Offline Metric for the Debiasedness of Click Models) - 专知论文

会员服务 ·

0

偏差 · 评估指标 · 相关性 · 反事实学习排序 · 排序 ·

2023 年 4 月 19 日

An Offline Metric for the Debiasedness of Click Models

翻译：click模型非偏差度的离线评估指标

Romain Deffayet,Philipp Hager,Jean-Michel Renders,Maarten de Rijke

from arxiv, SIGIR23 - Full paper

A well-known problem when learning from user clicks are inherent biases prevalent in the data, such as position or trust bias. Click models are a common method for extracting information from user clicks, such as document relevance in web search, or to estimate click biases for downstream applications such as counterfactual learning-to-rank, ad placement, or fair ranking. Recent work shows that the current evaluation practices in the community fail to guarantee that a well-performing click model generalizes well to downstream tasks in which the ranking distribution differs from the training distribution, i.e., under covariate shift. In this work, we propose an evaluation metric based on conditional independence testing to detect a lack of robustness to covariate shift in click models. We introduce the concept of debiasedness and a metric for measuring it. We prove that debiasedness is a necessary condition for recovering unbiased and consistent relevance scores and for the invariance of click prediction under covariate shift. In extensive semi-synthetic experiments, we show that our proposed metric helps to predict the downstream performance of click models under covariate shift and is useful in an off-policy model selection setting.

翻译：学习用户点击时普遍存在固有偏差，比如位置或信任偏差等问题，这是一个公认的问题。click模型是从用户点击中提取信息的常见方法，例如网页搜索中的文档相关性或者评估点击偏差用于下游任务，例如因果反事实学习排序算法、广告投放或公平排序。最近的研究显示，社区中的现有评估实践不能保证良好性能的click模型在排名分布与训练分布不同即协变量转移时有良好的推广性。在这项工作中，我们提出了一种基于条件独立性检验的评估指标，以检测click模型在协变量转移方面缺乏鲁棒性。我们引入了“非偏差度”概念和度量方法。我们证明了非偏差度是恢复无偏和一致相关性分数的必要条件，且在协变量转移下点击预测的不变性得以保持。在大量半合成实验中，我们展示了我们提出的指标有助于预测click模型在协变量转移下的下游表现，并且在离线模型选择设置中非常有用。

0

相关内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【AAAI 2022】使用点反馈与标准离线黑箱算法的在线影响力最大化问题

【AAAI 2022】使用点反馈与标准离线黑箱算法的在线影响力最大化问题

专知会员服务

14+阅读 · 2022年1月16日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

函数数据变换模型及降维方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

4+阅读 · 2014年12月31日

迭代变化因素下基于二维H∞理论的迭代学习控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

含指标项的变换模型的估计与经验似然分析

国家自然科学基金

0+阅读 · 2012年12月31日

非参数变换模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

正相协及缺失数据情形的经验似然推断

国家自然科学基金

0+阅读 · 2012年12月31日

Fgf19对耳蜗毛细胞发育调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于公式的数学搜索引擎的研究与开发

国家自然科学基金

0+阅读 · 2009年12月31日

Unlocking the Potential of Federated Learning for Deeper Models

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Bayesian nonparametric modeling of latent partitions via Stirling-gamma priors

Arxiv

0+阅读 · 2023年6月4日

Auditing for Human Expertise

Arxiv

0+阅读 · 2023年6月2日

Semiparametric efficient estimation of genetic relatedness with machine learning methods

Arxiv

0+阅读 · 2023年6月2日

Byzantine-Robust Clustered Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Nonparametric Identifiability of Causal Representations from Unknown Interventions

Arxiv

0+阅读 · 2023年6月1日

Time and Space Optimal Massively Parallel Algorithm for the 2-Ruling Set Problem

Arxiv

0+阅读 · 2023年6月1日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

VIP会员

文章信息

相关主题

反事实学习排序

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【AAAI 2022】使用点反馈与标准离线黑箱算法的在线影响力最大化问题

【AAAI 2022】使用点反馈与标准离线黑箱算法的在线影响力最大化问题

专知会员服务

14+阅读 · 2022年1月16日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Unlocking the Potential of Federated Learning for Deeper Models

Arxiv

0+阅读 · 2023年6月5日

Data Quality in Imitation Learning

Arxiv

0+阅读 · 2023年6月4日

Bayesian nonparametric modeling of latent partitions via Stirling-gamma priors

Arxiv

0+阅读 · 2023年6月4日

Auditing for Human Expertise

Arxiv

0+阅读 · 2023年6月2日

Semiparametric efficient estimation of genetic relatedness with machine learning methods

Arxiv

0+阅读 · 2023年6月2日

Byzantine-Robust Clustered Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Nonparametric Identifiability of Causal Representations from Unknown Interventions

Arxiv

0+阅读 · 2023年6月1日

Time and Space Optimal Massively Parallel Algorithm for the 2-Ruling Set Problem

Arxiv

0+阅读 · 2023年6月1日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

相关基金

函数数据变换模型及降维方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

变换结构方程模型的非参数贝叶斯分析

国家自然科学基金

4+阅读 · 2014年12月31日

迭代变化因素下基于二维H∞理论的迭代学习控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

含指标项的变换模型的估计与经验似然分析

国家自然科学基金

0+阅读 · 2012年12月31日

非参数变换模型的统计推断

国家自然科学基金

0+阅读 · 2012年12月31日

IRES调控EV71神经毒性的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

正相协及缺失数据情形的经验似然推断

国家自然科学基金

0+阅读 · 2012年12月31日

Fgf19对耳蜗毛细胞发育调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于公式的数学搜索引擎的研究与开发

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员