We consider the classical uncoded caching problem from an online learning point-of-view. A cache of limited storage capacity can hold $C$ files at a time from a large catalog. A user requests an arbitrary file from the catalog at each time slot. Before the file request from the user arrives, a caching policy populates the cache with any $C$ files of its choice. In the case of a cache-hit, the policy receives a unit reward and zero rewards otherwise. In addition to that, there is a cost associated with fetching files to the cache, which we refer to as the switching cost. The objective is to design a caching policy that incurs minimal regret while considering both the rewards due to cache-hits and the switching cost due to the file fetches. The main contribution of this paper is the switching regret analysis of a Follow the Perturbed Leader-based anytime caching policy, which is shown to have an order optimal switching regret. In this pursuit, we improve the best-known switching regret bound for this problem by a factor of $\Theta(\sqrt{C}).$ We conclude the paper by comparing the performance of different popular caching policies using a publicly available trace from a commercial CDN server.
翻译:我们从在线学习的角度来考虑古典的未编码缓存问题。 有限的存储容量缓存能力可以同时从大型目录中保存 $C 的文件。 用户在每一个时间段都要求从目录中获取任意的文件。 在用户的文档请求到来之前, 缓存政策会用其选择的任何$C 的文档来填充缓存。 在缓存打击政策中, 该政策会得到单位奖赏和零奖励。 除此之外, 将文件拿回缓存, 我们称之为转换成本, 目的是设计一个缓存政策, 在考虑缓存后应得的缓存和文件取应得的切换费用的同时, 产生最低程度的遗憾。 本文的主要贡献是, 调试分析一个基于 Perturbed 领导人的缓存政策, 这表明该政策有一个最有序的转置后悔。 此外, 我们通过一个 $\ Theta{N} 可公开使用的服务器政策来改进这一问题最著名的转接的遗憾。 我们用一个 $\\\\\\\\ n\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\