The rise of big data analytics has automated the decision-making of companies and increased supply chain agility. In this paper, we study the supply chain contract design problem faced by a data-driven supplier who needs to respond to the inventory decisions of the downstream retailer. Both the supplier and the retailer are uncertain about the market demand and need to learn about it sequentially. The goal for the supplier is to develop data-driven pricing policies with sublinear regret bounds under a wide range of possible retailer inventory policies for a fixed time horizon. To capture the dynamics induced by the retailer's learning policy, we first make a connection to non-stationary online learning by following the notion of variation budget. The variation budget quantifies the impact of the retailer's learning strategy on the supplier's decision-making. We then propose dynamic pricing policies for the supplier for both discrete and continuous demand. We also note that our proposed pricing policy only requires access to the support of the demand distribution, but critically, does not require the supplier to have any prior knowledge about the retailer's learning policy or the demand realizations. We examine several well-known data-driven policies for the retailer, including sample average approximation, distributionally robust optimization, and parametric approaches, and show that our pricing policies lead to sublinear regret bounds in all these cases. At the managerial level, we answer affirmatively that there is a pricing policy with a sublinear regret bound under a wide range of retailer's learning policies, even though she faces a learning retailer and an unknown demand distribution. Our work also provides a novel perspective in data-driven operations management where the principal has to learn to react to the learning policies employed by other agents in the system.
翻译:大数据分析器的崛起使公司的决策自动化,提高了供应链的灵活性。在本文中,我们研究了数据驱动的供应商面临的供应链合同设计问题,该供应商需要响应下游零售商的库存决定。供应商和零售商都对市场需求感到不确定,需要按顺序了解。供应商的目标是制定数据驱动定价政策,在一系列可能的零售商库存政策下,在固定的时间范围内进行亚线性遗憾限制。为了捕捉零售商学习政策引发的动态,我们首先研究数据驱动供应商面临的供应链合同设计问题,因为数据驱动供应商需要响应下游零售商的库存决定。我们研究了零售商学习战略对供应商决策的影响。我们提出的价格政策只是需要获得需求分布的支持,但批评地是,我们并不要求供应商事先对零售商学习政策或需求实现情况有任何未知的了解。我们研究了几个零售商学习策略的动态,我们通过模拟的零售商营销政策,我们从数据驱动政策向下展示了一种我们所熟知的动态的排序政策,我们从下向管理层学习。我们研究的是,我们所认识的零售商的零售商营销政策,我们从下,从数据驱动的排序政策向下学习了一种我们的数据驱动的亚。