We consider the online caching problem for a cache of limited size. In a time-slotted system, a user requests one file from a large catalog in each slot. If the requested file is cached, the policy receives a unit reward and zero rewards otherwise. We show that a Follow the Perturbed Leader (FTPL)-based anytime caching policy is simultaneously regret-optimal for both adversarial and i.i.d. stochastic arrivals. Further, in the setting where there is a cost associated with switching the cached contents, we propose a variant of FTPL that is order-optimal with respect to time for both adversarial and stochastic arrivals and has a significantly better performance compared to FTPL with respect to the switching cost for stochastic arrivals. We also show that these results can be generalized to the setting where there are constraints on the frequency with which cache contents can be changed. Finally, we validate the results obtained on various synthetic as well as real-world traces.
翻译:我们考虑一个数量有限的缓存点的在线缓存问题。 在时间分布的系统中, 用户会从每个空档的大型目录中请求一个文件。 如果所请求的文件被缓存, 政策将获得单位奖励和零奖励。 我们显示, 以受困头领( FTPL) 为基础的随时缓存政策同时对敌对和i. i. id. 随机抵达者来说都是最遗憾的。 此外, 在调换缓存内容成本的环境下, 我们提出了一个FTPL 的变种, 它对对抗和随机抵达者的时间来说都是最符合命令的, 并且比 FTPL 在缓存抵达者的切换成本方面表现要好得多。 我们还表明, 这些结果可以被普遍化到对缓存内容可更改频率有限制的环境 。 最后, 我们验证了各种合成和真实世界痕迹的结果 。