The classical algorithms for online learning and decision-making have the benefit of achieving the optimal performance guarantees, but suffer from computational complexity limitations when implemented at scale. More recent sophisticated techniques, which we refer to as oracle-efficient methods, address this problem by dispatching to an offline optimization oracle that can search through an exponentially-large (or even infinite) space of decisions and select that which performed the best on any dataset. But despite the benefits of computational feasibility, oracle-efficient algorithms exhibit one major limitation: while performing well in worst-case settings, they do not adapt well to friendly environments. In this paper we consider two such friendly scenarios, (a) "small-loss" problems and (b) IID data. We provide a new framework for designing follow-the-perturbed-leader algorithms that are oracle-efficient and adapt well to the small-loss environment, under a particular condition which we call approximability (which is spiritually related to sufficient conditions provided by Dud\'{i}k et al., [2020]). We identify a series of real-world settings, including online auctions and transductive online classification, for which approximability holds. We also extend the algorithm to an IID data setting and establish a "best-of-both-worlds" bound in the oracle-efficient setting.
翻译:在线学习和决策的经典算法具有实现最佳绩效保障的好处,但在大规模实施时受到计算复杂性的限制。 最新的尖端技术(我们称之为甲骨文效率方法)通过向离线优化或触角发送能够通过超大(甚至无限)决策空间搜索的离线优化或触角法来解决这个问题,并选择在任何数据集上表现最佳的算法。 尽管计算可行性的好处,但甲骨文效率算法显示出一个重大限制:在最坏情况下运行良好,但并不适应友好的环境。在本文件中,我们考虑了两种友好的假设,(a)“小额损失”问题和(b) IID数据。我们为设计后续-超低层领导算法提供了一个新的框架,这些算法能够通过指数快速(甚至无限)在任何数据集上搜索最优秀的计算空间。尽管计算可行性,但甲骨文效率算法(这在精神上与Dud\'{i}及他人提供的充分条件有关,它们并不适应友好的环境。 我们确定了一系列真实世界的设置环境,包括在线拍卖和跨层算法(也用于在线)的“升级和跨世界数据配置。