Auctions with partially-revealed information about items are broadly employed in real-world applications, but the underlying mechanisms have limited theoretical support. In this work, we study a machine learning formulation of these types of mechanisms, presenting algorithms that are no-regret from the buyer's perspective. Specifically, a buyer who wishes to maximize his utility interacts repeatedly with a platform over a series of $T$ rounds. In each round, a new item is drawn from an unknown distribution and the platform publishes a price together with incomplete, "masked" information about the item. The buyer then decides whether to purchase the item. We formalize this problem as an online learning task where the goal is to have low regret with respect to a myopic oracle that has perfect knowledge of the distribution over items and the seller's masking function. When the distribution over items is known to the buyer and the mask is a SimHash function mapping $\mathbb{R}^d$ to $\{0,1\}^{\ell}$, our algorithm has regret $\tilde O((Td\ell)^{1/2})$. In a fully agnostic setting when the mask is an arbitrary function mapping to a set of size $n$ and the prices are stochastic, our algorithm has regret $\tilde O((Tn)^{1/2})$.
翻译:在现实世界应用中,对产品进行部分披露的拍卖信息被广泛用于实际应用,但基本机制的理论支持有限。在这项工作中,我们研究一种机器学习的这类机制的配方,从买主的角度介绍无回报的算法。具体地说,一个希望最大限度地扩大其效用的买主与一系列美元回合的平台反复互动。在每轮中,从一个未知的分发中抽出一个新项目,平台公布一个价格,同时公布一个不完整的、“虚装”的项目信息。买主然后决定是否购买该项目。我们将此问题正式化为在线学习任务,目标是对一个精通物品分配和卖方遮掩功能的近视镜或暗器表示低遗憾。当买主知道物品的分配情况时,面具是一个SimHash函数,绘制$\mathb{R ⁇ d$ to$0.1 ⁇ ell},我们的算法对O((Tell)\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\