无悔恨的在线缓冲:通过建议进行乐观学习 (Online Caching with no Regret: Optimistic Learning via Recommendations)

The design of effective online caching policies is an increasingly important problem for content distribution networks, online social networks and edge computing services, among other areas. This paper proposes a new algorithmic toolbox for tackling this problem through the lens of optimistic online learning. We build upon the Follow-the-Regularized-Leader (FTRL) framework, which is developed further here to include predictions for the file requests, and we design online caching algorithms for bipartite networks with fixed-size caches or elastic leased caches subject to time-average budget constraints. The predictions are provided by a content recommendation system that influences the users viewing activity and hence can naturally reduce the caching network's uncertainty about future requests. We also extend the framework to learn and utilize the best request predictor in cases where many are available. We prove that the proposed {optimistic} learning caching policies can achieve sub-zero performance loss (regret) for perfect predictions, and maintain the sub-linear regret bound $O(\sqrt T)$, which is the best achievable bound for policies that do not use predictions, even for arbitrary-bad predictions. The performance of the proposed algorithms is evaluated with detailed trace-driven numerical tests.

翻译：有效的在线缓存政策的设计对于内容分发网络、在线社交网络和边际计算服务等领域来说是一个日益重要的问题。本文件提出一个新的算法工具箱,通过乐观的在线学习透镜来解决这一问题。我们以“追踪(Regulalized-Leader)”框架为基础,在此进一步制定该框架,以包括对文件请求的预测;我们为有固定大小缓存或弹性租赁缓存的固定规模的双方网络设计在线缓存算法(regret),但受预算平均时间限制。预测由内容建议系统提供,该系统影响用户观看活动,从而自然减少缓存网络对未来请求的不确定性。我们还扩展了框架,以学习和利用可供许多人使用的最佳请求预测器。我们证明拟议的{optic}缓存政策可以实现最精确的零度性能损失(regret),并保持亚线性遗憾($O(sqrt T)$,这是不使用详细预测的量化(甚至任意性地)政策的最佳可实现。