Two-player zero-sum games of infinite duration and their quantitative versions are used in verification to model the interaction between a controller (Eve) and its environment (Adam). The question usually addressed is that of the existence (and computability) of a strategy for Eve that can maximize her payoff against any strategy of Adam. In this work, we are interested in strategies of Eve that minimize her regret, i.e. strategies that minimize the difference between her actual payoff and the payoff she could have achieved if she had known the strategy of Adam in advance. We give algorithms to compute the strategies of Eve that ensure minimal regret against an adversary whose choice of strategy is (i) unrestricted, (ii) limited to positional strategies, or (iii) limited to word strategies. We also establish relations between the latter version and other problems studied in the literature.
翻译:在核查中,使用无限时间的双玩零和游戏及其定量版本来模拟控制器(Eve)及其环境(Adam)之间的相互作用。通常处理的问题是夏娃战略的存在(和可计算性),这种战略能够使她对亚当的任何战略获得最大的回报。在这项工作中,我们感兴趣的是夏娃战略,这种战略可以最大限度地减少她的遗憾,即如果她事先知道亚当的战略,将实际报酬与她本可以实现的回报之间的差别降到最低。我们提供算法来计算夏娃战略,以确保对一个选择战略(一) 不受限制,(二) 限于定位战略,或(三) 限于文字战略的对手的最小遗憾。我们还在后者与文献研究的其他问题之间建立关系。