速率-最佳环境环境在线匹配强盗 (Rate-Optimal Contextual Online Matching Bandit) - 专知论文

会员服务 ·

0

赌博机/老虎机 · INFORMS · 在线 · 讲稿 · 稳健性 ·

2022 年 5 月 7 日

Rate-Optimal Contextual Online Matching Bandit

翻译：速率-最佳环境环境在线匹配强盗

Yuantong Li,Chi-hua Wang,Guang Cheng,Will Wei Sun

from arxiv, 43 pages, 9 figures

Two-sided online matching platforms have been employed in various markets. However, agents' preferences in present market are usually implicit and unknown and must be learned from data. With the growing availability of side information involved in the decision process, modern online matching methodology demands the capability to track preference dynamics for agents based on their contextual information. This motivates us to consider a novel Contextual Online Matching Bandit prOblem (COMBO), which allows dynamic preferences in matching decisions. Existing works focus on multi-armed bandit with static preference, but this is insufficient: the two-sided preference changes as along as one-side's contextual information updates, resulting in non-static matching. In this paper, we propose a Centralized Contextual - Explore Then Commit (CC-ETC) algorithm to adapt to the COMBO. CC-ETC solves online matching with dynamic preference. In theory, we show that CC-ETC achieves a sublinear regret upper bound O(log(T)) and is a rate-optimal algorithm by proving a matching lower bound. In the experiments, we demonstrate that CC-ETC is robust to variant preference schemes, dimensions of contexts, reward noise levels, and contexts variation levels.

翻译：在不同市场上采用了双面在线匹配平台。但是,当前市场的代理商偏好通常是隐含的和未知的,必须从数据中学习。随着决策过程中的侧面信息越来越多,现代在线匹配方法要求具备根据背景信息跟踪代理商偏好动态的能力。这促使我们考虑一种新的环境在线匹配大盗大盗大盗大盗大盗大案(COMBO),允许在匹配决定中提供动态偏好。现有的工程侧重于具有静态偏好的多臂强盗,但这还不够:在单面背景信息更新的同时,双面偏好变化,导致非静态匹配。在本文中,我们提出了一种中央化背景 — 探索(CC- ETC) 算法,以适应COMBO。 CC- ETC 解决在线匹配动态偏好的问题。在理论上,我们显示CC- ETC 实现了亚线性遗憾高约束O(log(T) ), 并且是一种比率- 最佳算法, 证明匹配较低约束。在实验中, 我们证明CC-ETC 强于变式的优惠计划、范围、背景环境、和奖励等级。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Waardenburg综合征的拷贝数变异检测及其致病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

东亚—北美间断分布草本菝葜类群的物种形成和谱系地理学研究

国家自然科学基金

0+阅读 · 2015年12月31日

结核感染人群中IL-22+ T细胞亚群的免疫学特征及其TCR-CDR3谱型分析

国家自然科学基金

0+阅读 · 2013年12月31日

表面吸附和金属插层的石墨烯超导电性的理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

炎症共刺激分子CD137-CD137L调控NFATc1启动粥样斑块钙化的机制

国家自然科学基金

0+阅读 · 2012年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

检测SF6分解特征组分的复合掺杂TiO2纳米管气敏传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-449介导KDM4C-Notch通路在三阴性乳腺癌增殖转移中的调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

CuInS2量子点敏化纳米TiO2太阳电池的界面电子复合机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于ITO的铁磁体/半导体复合结构的自旋相关输运

国家自然科学基金

0+阅读 · 2009年12月31日

Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits

Arxiv

0+阅读 · 2022年6月28日

Distributed Bayesian Online Learning for Cooperative Manipulation

Arxiv

0+阅读 · 2022年6月28日

Supply-Side Equilibria in Recommender Systems

Arxiv

0+阅读 · 2022年6月27日

Learning to Anticipate Future with Dynamic Context Removal

Arxiv

0+阅读 · 2022年6月27日

Beating Greedy Matching in Sublinear Time

Arxiv

0+阅读 · 2022年6月27日

AIR-Net: Adaptive and Implicit Regularization Neural Network for Matrix Completion

Arxiv

0+阅读 · 2022年6月27日

Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs

Arxiv

0+阅读 · 2022年6月24日

Cyclic Graph Attentive Match Encoder (CGAME): A Novel Neural Network For OD Estimation

Arxiv

0+阅读 · 2022年6月24日

Geometric Policy Iteration for Markov Decision Processes

Arxiv

0+阅读 · 2022年6月24日

On making optimal transport robust to all outliers

Arxiv

0+阅读 · 2022年6月23日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

用于无人机的C波段空地通信系统研究 | 2025最新116页

甚高频军事战术通信系统传播性能分析研究

军事通信系统：安全行动的支柱

卫星与地面通信系统：美陆军面临的空间与电子战局势 | 39页报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits

Arxiv

0+阅读 · 2022年6月28日

Distributed Bayesian Online Learning for Cooperative Manipulation

Arxiv

0+阅读 · 2022年6月28日

Supply-Side Equilibria in Recommender Systems

Arxiv

0+阅读 · 2022年6月27日

Learning to Anticipate Future with Dynamic Context Removal

Arxiv

0+阅读 · 2022年6月27日

Beating Greedy Matching in Sublinear Time

Arxiv

0+阅读 · 2022年6月27日

AIR-Net: Adaptive and Implicit Regularization Neural Network for Matrix Completion

Arxiv

0+阅读 · 2022年6月27日

Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs

Arxiv

0+阅读 · 2022年6月24日

Cyclic Graph Attentive Match Encoder (CGAME): A Novel Neural Network For OD Estimation

Arxiv

0+阅读 · 2022年6月24日

Geometric Policy Iteration for Markov Decision Processes

Arxiv

0+阅读 · 2022年6月24日

On making optimal transport robust to all outliers

Arxiv

0+阅读 · 2022年6月23日

相关基金

Waardenburg综合征的拷贝数变异检测及其致病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

东亚—北美间断分布草本菝葜类群的物种形成和谱系地理学研究

国家自然科学基金

0+阅读 · 2015年12月31日

结核感染人群中IL-22+ T细胞亚群的免疫学特征及其TCR-CDR3谱型分析

国家自然科学基金

0+阅读 · 2013年12月31日

表面吸附和金属插层的石墨烯超导电性的理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

炎症共刺激分子CD137-CD137L调控NFATc1启动粥样斑块钙化的机制

国家自然科学基金

0+阅读 · 2012年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

检测SF6分解特征组分的复合掺杂TiO2纳米管气敏传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-449介导KDM4C-Notch通路在三阴性乳腺癌增殖转移中的调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

CuInS2量子点敏化纳米TiO2太阳电池的界面电子复合机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于ITO的铁磁体/半导体复合结构的自旋相关输运

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员