Auctions with partially-revealed information about items are broadly employed in real-world applications, but the underlying mechanisms have limited theoretical support. In this work, we study a machine learning formulation of these types of mechanisms, presenting algorithms that are no-regret from the buyer's perspective. Specifically, a buyer who wishes to maximize his utility interacts repeatedly with a platform over a series of $T$ rounds. In each round, a new item is drawn from an unknown distribution and the platform publishes a price together with incomplete, "masked" information about the item. The buyer then decides whether to purchase the item. We formalize this problem as an online learning task where the goal is to have low regret with respect to a myopic oracle that has perfect knowledge of the distribution over items and the seller's masking function. When the distribution over items is known to the buyer and the mask is a SimHash function mapping $\mathbb{R}^d$ to $\{0,1\}^{\ell}$, our algorithm has regret $\tilde {\mathcal{O}}((Td\ell)^{\frac{1}{2}})$. In a fully agnostic setting when the mask is an arbitrary function mapping to a set of size $n$, our algorithm has regret $\tilde {\mathcal{O}}(T^{\frac{3}{4}}n^{\frac{1}{2}})$. Finally, when the prices are stochastic, the algorithm has regret $\tilde {\mathcal{O}}((Tn)^{\frac{1}{2}})$.
翻译:含有部分反响的物品信息的拍卖 { 部分反射信息被广泛用于现实世界应用程序 { {, 但基础机制有有限的理论支持 。 在这项工作中, 我们研究一种机器学习的这些类型机制的配方。 我们研究一种机器学习的配方, 从买主的角度看, 呈现出一种无效果的算法。 具体地说, 买主希望将其效用最大化, 在一系列的 $T 回合中与一个平台进行多次互动。 每一轮中, 都会从未知的分发中抽出一个新项目, 平台公布一个价格与不完整的“ 伪造” 项目信息。 买主然后决定是否购买此项目。 我们将此问题正式化为在线学习任务, 目标是对一个对项目分布和卖方遮掩功能有完全了解的远端或暗的算法 。 当买主知道物品的分布是SimHash函数映射$\bb{Rd$ to $0, 1\\\ $, 1\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\