We address the problem of sequential prediction with expert advice in a non-stationary environment with long-term memory guarantees in the sense of Bousquet and Warmuth [4]. We give a linear-time algorithm that improves on the best known regret bounds [26]. This algorithm incorporates a relative entropy projection step. This projection is advantageous over previous weight-sharing approaches in that weight updates may come with implicit costs as in for example portfolio optimization. We give an algorithm to compute this projection step in linear time, which may be of independent interest.
翻译:我们用专家建议来解决在非静止环境中进行连续预测的问题,这种环境具有布斯克特和沃穆特[4] 意义上的长期内存保证。我们给出线性时间算法,改进最已知的遗憾界限[26]。这种算法包含一个相对的星盘投影步骤。这种预测比先前的权重分担方法更有利,因为更新权重可能带来隐含成本,例如组合优化。我们给出一种算法,用线性时间计算这一预测步骤,这可能具有独立的兴趣。