使用位置偏差确定性排名列表的基于离线估计分数的逆向分数 (Inverse Propensity Score based offline estimator for deterministic ranking lists using position bias)

In this work, we present a novel way of computing IPS using a position-bias model for deterministic logging policies. This technique significantly widens the policies on which OPE can be used. We validate this technique using two different experiments on industry-scale data. The OPE results are clearly strongly correlated with the online results, with some constant bias. The estimator requires the examination model to be a reasonably accurate approximation of real user behaviour.

翻译：在这项工作中,我们提出了一种使用确定性伐木政策的位置偏差模型计算IPS的新方式。这种技术极大地扩大了OPE可以使用的政策范围。我们用两种不同的工业规模数据实验来验证这种技术。OPE的结果显然与在线结果密切相关,并有一些持续的偏差。估计数字要求测试模型对实际用户行为进行合理的准确近似。

相关内容

估计/估计量

关注 3

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

开放量子系统非马尔科夫动力学过程量子仿真研究

国家自然科学基金

0+阅读 · 2014年12月31日

膜蛋白介导受IRES调控的cyclin B1促进食管癌转移的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

基于全基因组测序基础上的芝麻茎点枯病菌（Macrophomina phaseolina）致病相关基因分析

国家自然科学基金

0+阅读 · 2013年12月31日

基于粗糙信息的多自主体编队控制

国家自然科学基金

4+阅读 · 2013年12月31日

基于视觉的打乒乓球机器人仿人击球策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为的SoS体系结构评价研究

国家自然科学基金

1+阅读 · 2012年12月31日

非线性系统和网络控制系统的迭代学习控制律的收敛性态研究

国家自然科学基金

0+阅读 · 2012年12月31日

多智能体系统的分布式动态覆盖控制

国家自然科学基金

5+阅读 · 2011年12月31日

$Contribution to the initialization of linear non-commensurate fractional-order systems for the joint estimation of parameters and fractional differentiation orders$

Contribution to the initialization of linear non-commensurate fractional-order systems for the joint estimation of parameters and fractional differentiation orders

Arxiv

0+阅读 · 2022年10月18日

Robust Reinforcement Learning using Offline Data

Arxiv

0+阅读 · 2022年10月18日

RAPO: An Adaptive Ranking Paradigm for Bilingual Lexicon Induction

Arxiv

0+阅读 · 2022年10月18日

LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation

Arxiv

0+阅读 · 2022年10月18日