Price discrimination, which refers to the strategy of setting different prices for different customer groups, has been widely used in online retailing. Although it helps boost the collected revenue for online retailers, it might create serious concern in fairness, which even violates the regulation and law. This paper studies the problem of dynamic discriminatory pricing under fairness constraints. In particular, we consider a finite selling horizon of length $T$ for a single product with two groups of customers. Each group of customers has its unknown demand function that needs to be learned. For each selling period, the seller determines the price for each group and observes their purchase behavior. While existing literature mainly focuses on maximizing revenue, ensuring fairness among different customers has not been fully explored in the dynamic pricing literature. In this work, we adopt the fairness notion from (Cohen et al. 2021a). For price fairness, we propose an optimal dynamic pricing policy in terms of regret, which enforces the strict price fairness constraint. In contrast to the standard $\sqrt{T}$-type regret in online learning, we show that the optimal regret in our case is $\tilde{\Theta}(T^{4/5})$. We further extend our algorithm to a more general notion of fairness, which includes demand fairness as a special case. To handle this general class, we propose a soft fairness constraint and develop the dynamic pricing policy that achieves $\tilde{O}(T^{4/5})$ regret.
翻译:价格歧视指的是为不同客户群体设定不同价格的战略,在网上零售中广泛使用。尽管它有助于增加网上零售商的税收,但可能会引起严重的公平问题,甚至违反法规和法律。本文研究了公平制约下动态歧视性定价的问题。特别是,我们考虑为单一产品和两组客户设定一个固定的销售期限,其长度为$T$,每组客户都有其未知的需求功能,需要学习。对于每个销售期,卖方决定每个集团的价格并观察其购买行为。虽然现有文献主要侧重于收入最大化,但动态定价文献并未充分探讨确保不同客户之间的公平。在这项工作中,我们采纳了(Cohen等人,2021a)的公平概念。关于价格公平,我们从遗憾的角度提出一个最佳的动态价格政策,要求严格的价格公平。 与在线学习中的标准$Qrt{T} 类型令人遗憾,我们显示我们案件中的最佳遗憾是$\tilde{T}(T ⁇ 4/5}确保不同客户之间的公平性。在动态定价文献中,我们采纳了公平性概念,我们进一步提出一个总体的公平性要求。