公平基于LP的在线拨款 (Fairer LP-based Online Allocation)

In this paper, we consider a Linear Program (LP)-based online resource allocation problem where a decision maker accepts or rejects incoming customer requests irrevocably in order to maximize expected revenue given limited resources. At each time, a new order/customer/bid is revealed with a request of some resource(s) and a reward. We consider a stochastic setting where all the orders are i.i.d. sampled from an unknown distribution. Such formulation contains many classic applications such as the canonical (quantity-based) network revenue management problem and the Adwords problem. Specifically, we study the objective of providing fairness guarantees while maintaining low regret. Our definition of fairness is that a fair online algorithm should treat similar agents/customers similarly and the decision made for similar individuals should be consistent over time. We define a fair offline solution as the analytic center of the offline optimal solution set, and propose a fair algorithm that uses an interior-point LP solver and dynamically detects unfair resource spending. Our algorithm can control cumulative unfairness (the cumulative deviation from the online solutions to the offline fair solution) on the scale of order $O(\log(T))$, while maintaining the regret to be bounded with dependency on $T$. Our approach do not formulate the fairness requirement as a constrain in the optimization instance, and instead we address the problem from the perspective of algorithm design. We get the desirable fairness guarantee without imposing any fairness constraint, and our regret result is strong for the reason that we evaluate the regret by comparing to the original objective value.

翻译：在本文中,我们考虑一个基于线性程序(LP)的在线资源分配问题,即决策者接受或拒绝来港客户的不可撤销的公平性要求,以便在有限的资源条件下最大限度地增加预期收入。每次,都会披露一个新的订单/客户/投标,同时提出一些资源要求和奖赏。我们考虑一个随机的设置,即所有订单都来自未知分布的样本。这种配置包含许多经典应用程序,如卡尼西亚(基于数量)网络收入管理问题和Adword问题。具体地说,我们研究提供公平保障的目标,同时保持低度的遗憾。我们的公平性定义是公平的在线算法,对类似的代理商/客户应同样对待,对类似的个人所作的决定应随着时间的推移一致。我们定义公平脱线解决方案,作为离线最佳解决方案的解析中心,提出公平性算法,使用内分点LP解决方案解决问题,动态地发现不公平的资源支出。我们的算法可以控制累积的不公平性(从在线解决方案到离线性合理性解决方案的累积性偏差,同时不以美元为我们的最佳依赖度,而以美元为我们的标准排序。