Modelling agent preferences has applications in a range of fields including economics and increasingly, artificial intelligence. These preferences are not always known and thus may need to be estimated from observed behavior, in which case a model is required to map agent preferences to behavior, also known as structural estimation. Traditional models are based on the assumption that agents are perfectly rational: that is, they perfectly optimize and behave in accordance with their own interests. Work in the field of behavioral game theory has shown, however, that human agents often make decisions that are imperfectly rational, and the field has developed models that relax the perfect rationality assumption. We apply models developed for predicting behavior towards estimating preferences and show that they outperform both traditional and commonly used benchmark models on data collected from human subjects. In fact, Nash equilibrium and its relaxation, quantal response equilibrium (QRE), can induce an inaccurate estimate of agent preferences when compared against ground truth. A key finding is that modelling non-strategic behavior, conventionally considered uniform noise, is important for estimating preferences. To this end, we introduce quantal-linear4, a rich non-strategic model. We also propose an augmentation to the popular quantal response equilibrium with a non-strategic component. We call this augmented model QRE+L0 and find an improvement in estimating values over the standard QRE. QRE+L0 allows for alternative models of non-strategic behavior in addition to quantal-linear4.
翻译:模拟剂偏好在一系列领域都有应用,包括经济学和越来越多的人工智能。这些偏好并不总是已知,因此可能需要根据观察到的行为来估计,在这种情况下,需要模型来绘制代理人对行为的偏好,也称为结构估计。传统模型所依据的假设是,代理人完全理性:即它们完全优化并符合其自身利益;但是,行为游戏理论领域的工作表明,人类代理人往往作出不完全合理的决定,而且实地开发了放松完美理性假设的模式。我们应用了预测行为模型来估计偏好,并表明它们超越了从人类主体收集的数据的传统和常用基准模型。事实上,纳什平衡及其放松、二次反应平衡(QRE)可以导致对代理人偏好作出不准确的估计,而与地面真理相比。一项关键发现是,模拟非战略行为(通常认为统一的噪音)对于估计偏好非常重要。我们为此又采用了宽度线4,一种丰富的非战略模型。我们还提议在从人类主体的二次反应中增加到非战略反应模式的基调度。我们提议,在战略模型中增加一种可选择的基度+战略模型。</s>