We consider dynamic pricing strategies in a streamed longitudinal data set-up where the objective is to maximize, over time, the cumulative profit across a large number of customer segments. We consider a dynamic probit model with the consumers' preferences as well as price sensitivity varying over time. Building on the well-known finding that consumers sharing similar characteristics act in similar ways, we consider a global shrinkage structure, which assumes that the consumers' preferences across the different segments can be well approximated by a spatial autoregressive (SAR) model. In such a streamed longitudinal set-up, we measure the performance of a dynamic pricing policy via regret, which is the expected revenue loss compared to a clairvoyant that knows the sequence of model parameters in advance. We propose a pricing policy based on penalized stochastic gradient descent (PSGD) and explicitly characterize its regret as functions of time, the temporal variability in the model parameters as well as the strength of the auto-correlation network structure spanning the varied customer segments. Our regret analysis results not only demonstrate asymptotic optimality of the proposed policy but also show that for policy planning it is essential to incorporate available structural information as policies based on unshrunken models are highly sub-optimal in the aforementioned set-up.
翻译:我们考虑动态定价策略,其中目标是在一个大量客户细分中,在时间上最大化累计利润。我们考虑具有随时间变化的使用者喜好和价格敏感度的动态 probit 模型。基于众所周知的消费者具有相似特征的发现,我们考虑全局收缩结构,该结构假定不同细分中消费者的偏好可以很好地近似为空间自回归(SAR)模型。在这种流式的纵向数据设置中,我们通过遗憾来度量动态定价策略的性能,遗憾是与预先知道模型参数序列的千里眼相比的预期收入损失。我们提出了一种基于惩罚随机梯度下降(PSGD)的定价策略,并明确将其遗憾作为时间、模型参数的时间变化以及跨越多样化客户细分的自相关网络结构的功能进行刻画。我们的遗憾分析结果不仅证明了所提出策略的渐进最优性,还表明对于策略规划而言,将可用的结构信息纳入其中是至关重要的,因为基于未经收缩的模型的策略在上述设置中高度次优。