In this paper, I formalize intelligence measurement in games by introducing mechanisms that assign a real number -- interpreted as an intelligence score -- to each player in a game. This score quantifies the ex-post strategic ability of the players based on empirically observable information, such as the actions of the players, the game's outcome, strength of the players, and a reference oracle machine such as a chess-playing artificial intelligence system. Specifically, I introduce two main concepts: first, the Game Intelligence (GI) mechanism, which quantifies a player's intelligence in a game by considering not only the game's outcome but also the "mistakes" made during the game according to the reference machine's intelligence. Second, I define gamingproofness, a practical and computational concept of strategyproofness. To illustrate the GI mechanism, I apply it to an extensive dataset comprising over a billion chess moves, including over a million moves made by top 20 grandmasters in history. Notably, Magnus Carlsen emerges with the highest GI score among all world championship games included in the dataset. In machine-vs-machine games, the well-known chess engine Stockfish comes out on top.
翻译:本文通过引入一种为博弈中每位参与者分配实数的机制——该实数被解释为智能分数——来形式化博弈中的智能度量。该分数基于可经验观察的信息(如参与者的行动、博弈结果、参与者实力以及参考预言机系统(例如国际象棋人工智能系统))量化参与者的后验策略能力。具体而言,我引入了两个核心概念:首先是博弈智能(GI)机制,该机制不仅考虑博弈结果,还依据参考机器的智能水平考量参与者在博弈过程中所犯的“错误”,从而量化其智能表现。其次,我定义了防博弈操纵性——一种实用且可计算的策略防操纵性概念。为阐释GI机制,我将其应用于一个包含超过十亿步国际象棋走法的庞大数据集,其中涵盖历史上排名前20位的特级大师所走的超过百万步棋。值得注意的是,在数据集中包含的所有世界冠军赛对局中,马格努斯·卡尔森获得了最高的GI分数。在机器对机器博弈中,著名的国际象棋引擎Stockfish位居榜首。