We introduce algorithms for online, full-information prediction that are competitive with contextual tree experts of unknown complexity, in both probabilistic and adversarial settings. We show that by incorporating a probabilistic framework of structural risk minimization into existing adaptive algorithms, we can robustly learn not only the presence of stochastic structure when it exists (leading to constant as opposed to $\mathcal{O}(\sqrt{T})$ regret), but also the correct model order. We thus obtain regret bounds that are competitive with the regret of an optimal algorithm that possesses strong side information about both the complexity of the optimal contextual tree expert and whether the process generating the data is stochastic or adversarial. These are the first constructive guarantees on simultaneous adaptivity to the model and the presence of stochasticity.
翻译:我们引入了在线全面信息预测的算法,这种算法在概率和对抗性环境下都与不明复杂背景的树专家具有竞争力。 我们表明,通过将结构风险最小化的概率框架纳入现有的适应性算法,我们不仅能够强有力地了解存在随机结构时是否存在(导致恒定而不是$mathcal{O}(sqrt{T})的遗憾),还可以了解正确的模式顺序。因此,我们获得了有竞争力的遗憾界限,因为最优的算法拥有关于最佳背景树专家复杂性和数据生成过程是随机或对抗性的有力侧边信息。这是同时适应模型和存在随机性的第一个建设性保障。