在线MNL选择条件下联合组合-库存优化 (Online Joint Assortment-Inventory Optimization under MNL Choices) - 专知论文

会员服务 ·

0

Oracle · 在线 · 算法 · 优化问题 · 未知参数 ·

2023 年 4 月 4 日

Online Joint Assortment-Inventory Optimization under MNL Choices

翻译：在线MNL选择条件下联合组合-库存优化

Yong Liang,Xiaojie Mao,Shiyuan Wang

We study an online joint assortment-inventory optimization problem, in which we assume that the choice behavior of each customer follows the Multinomial Logit (MNL) choice model, and the attraction parameters are unknown a priori. The retailer makes periodic assortment and inventory decisions to dynamically learn from the realized demands about the attraction parameters while maximizing the expected total profit over time. In this paper, we propose a novel algorithm that can effectively balance the exploration and exploitation in the online decision-making of assortment and inventory. Our algorithm builds on a new estimator for the MNL attraction parameters, a novel approach to incentivize exploration by adaptively tuning certain known and unknown parameters, and an optimization oracle to static single-cycle assortment-inventory planning problems with given parameters. We establish a regret upper bound for our algorithm and a lower bound for the online joint assortment-inventory optimization problem, suggesting that our algorithm achieves nearly optimal regret rate, provided that the static optimization oracle is exact. Then we incorporate more practical approximate static optimization oracles into our algorithm, and bound from above the impact of static optimization errors on the regret of our algorithm. At last, we perform numerical studies to demonstrate the effectiveness of our proposed algorithm.

翻译：我们研究了一种在线联合组合-库存优化问题，假设每个顾客的选择行为遵循多项式Logit（MNL）选择模型，并且吸引力参数先前未知。零售商周期性地做出组合和库存决策，以动态学习所实现的需求对吸引力参数，同时在时间上最大化预期总利润。在本文中，我们提出了一种新颖的算法，可以有效地平衡在线组合和库存决策中的探索和开发。我们的算法建立在MNL吸引参数的新估计方法、一种新颖的通过自适应调整某些已知和未知参数来激励探索的方法以及一个优化oracle的基础上，该oracle针对具有给定参数的静态单周期组合-库存规划问题。我们为我们的算法建立了一个后悔上限，并为在线联合组合-库存优化问题建立了一个下限，表明我们的算法实现了接近最优的后悔率，前提是静态优化oracle是精确的。然后，我们将更实际的近似静态优化oracle纳入我们的算法，并从上方限制了静态优化错误对我们算法的后悔的影响。最后，我们进行了数字研究，证明了我们提出的算法的有效性。

0

相关内容

Oracle

甲骨文公司，全称甲骨文股份有限公司(甲骨文软件系统有限公司)，是全球最大的企业级软件公司，总部位于美国加利福尼亚州的红木滩。1989年正式进入中国市场。2013年，甲骨文已超越 IBM ，成为继 Microsoft 后全球第二大软件公司。

【ICDM2022教程】多目标优化与推荐，173页ppt

【ICDM2022教程】多目标优化与推荐，173页ppt

专知会员服务

46+阅读 · 2022年12月24日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

专知会员服务

24+阅读 · 2020年3月31日

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

专知会员服务

16+阅读 · 2019年12月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

NeurIPS 2022 Oral | 基于最优子集的神经集合函数学习方法EquiVSet

NeurIPS 2022 Oral | 基于最优子集的神经集合函数学习方法EquiVSet

机器之心

0+阅读 · 2022年11月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

模糊情况下的最优消费与投资

国家自然科学基金

3+阅读 · 2015年12月31日

测距优化与能量有效覆盖的三维水声传感器网络目标定位跟踪技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

信息产品与附加服务的最优定价策略研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于自适应动态规划的非线性系统鲁棒控制与分散镇定

国家自然科学基金

3+阅读 · 2013年12月31日

垄断及双寡头市场条件下企业级软件交付模式的研究

国家自然科学基金

2+阅读 · 2013年12月31日

无线传感器/执行器网络移动性管理和休眠唤醒机制的联合调度与优化

国家自然科学基金

0+阅读 · 2013年12月31日

在线库存及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

易逝品供应链价格保护策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于期限结构模型的中国债券信用利差体系研究

国家自然科学基金

0+阅读 · 2011年12月31日

能量高效的端到端混合任务实时调度算法和协议

国家自然科学基金

0+阅读 · 2009年12月31日

On the online path extension problem -- Location and routing problems in board games

Arxiv

0+阅读 · 2023年5月22日

Zero-shot Transferable and Persistently Feasible Safe Control for High Dimensional Systems by Consistent Abstraction

Arxiv

0+阅读 · 2023年5月21日

A repeated unknown game: Decentralized task offloading in vehicular fog computing

Arxiv

0+阅读 · 2023年5月20日

Distribution-Free Matrix Prediction Under Arbitrary Missing Pattern

Arxiv

0+阅读 · 2023年5月19日

Consistent Point Data Assimilation in Firedrake and Icepack

Arxiv

0+阅读 · 2023年5月19日

Distributional Multi-Objective Decision Making

Arxiv

0+阅读 · 2023年5月19日

Towards Power-Efficient Design of Myoelectric Controller based on Evolutionary Computation

Arxiv

0+阅读 · 2023年5月18日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

VIP会员

文章信息

相关主题

相关VIP内容

【ICDM2022教程】多目标优化与推荐，173页ppt

【ICDM2022教程】多目标优化与推荐，173页ppt

专知会员服务

46+阅读 · 2022年12月24日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

专知会员服务

24+阅读 · 2020年3月31日

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

专知会员服务

16+阅读 · 2019年12月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

NeurIPS 2022 Oral | 基于最优子集的神经集合函数学习方法EquiVSet

NeurIPS 2022 Oral | 基于最优子集的神经集合函数学习方法EquiVSet

机器之心

0+阅读 · 2022年11月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

On the online path extension problem -- Location and routing problems in board games

Arxiv

0+阅读 · 2023年5月22日

Zero-shot Transferable and Persistently Feasible Safe Control for High Dimensional Systems by Consistent Abstraction

Arxiv

0+阅读 · 2023年5月21日

A repeated unknown game: Decentralized task offloading in vehicular fog computing

Arxiv

0+阅读 · 2023年5月20日

Distribution-Free Matrix Prediction Under Arbitrary Missing Pattern

Arxiv

0+阅读 · 2023年5月19日

Consistent Point Data Assimilation in Firedrake and Icepack

Arxiv

0+阅读 · 2023年5月19日

Distributional Multi-Objective Decision Making

Arxiv

0+阅读 · 2023年5月19日

Towards Power-Efficient Design of Myoelectric Controller based on Evolutionary Computation

Arxiv

0+阅读 · 2023年5月18日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

相关基金

模糊情况下的最优消费与投资

国家自然科学基金

3+阅读 · 2015年12月31日

测距优化与能量有效覆盖的三维水声传感器网络目标定位跟踪技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

信息产品与附加服务的最优定价策略研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于自适应动态规划的非线性系统鲁棒控制与分散镇定

国家自然科学基金

3+阅读 · 2013年12月31日

垄断及双寡头市场条件下企业级软件交付模式的研究

国家自然科学基金

2+阅读 · 2013年12月31日

无线传感器/执行器网络移动性管理和休眠唤醒机制的联合调度与优化

国家自然科学基金

0+阅读 · 2013年12月31日

在线库存及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

易逝品供应链价格保护策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于期限结构模型的中国债券信用利差体系研究

国家自然科学基金

0+阅读 · 2011年12月31日

能量高效的端到端混合任务实时调度算法和协议

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员