Multi-agent market simulation is an effective tool to investigate the impact of various trading strategies in financial markets. One way of designing a trading agent in simulated markets is through reinforcement learning where the agent is trained to optimize its cumulative rewards (e.g., maximizing profits, minimizing risk, improving equitability). While the agent learns a rational policy that optimizes the reward function, in reality, human investors are sub-rational with their decisions often differing from the optimal. In this work, we model human sub-rationality as resulting from two possible causes: psychological bias and computational limitation. We first examine the relationship between investor profits and their degree of sub-rationality, and create hand-crafted market scenarios to intuitively explain the sub-rational human behaviors. Through experiments, we show that our models successfully capture human sub-rationality as observed in the behavioral finance literature. We also examine the impact of sub-rational human investors on market observables such as traded volumes, spread and volatility. We believe our work will benefit research in behavioral finance and provide a better understanding of human trading behavior.
翻译:多试剂市场模拟是调查金融市场各种贸易战略影响的有效工具。在模拟市场设计贸易代理物的一种方法,是强化学习,培训该代理物如何优化其累积收益(例如,利润最大化、风险最小化、公平性提高等 ) 。该代理物学习了合理政策,优化了奖励功能,但在现实中,人类投资者与其决定往往与最佳决定不同。在这项工作中,我们模拟了由两种可能原因导致的人类亚合理化:心理偏差和计算限制。我们首先研究投资者利润与其次级合理化程度之间的关系,并创造手工设计的市场情景,直截了当地解释非理性的人类行为。我们通过实验表明,我们的模型成功地捕捉了行为金融文献中观察到的人类亚理性。我们还审视了次级人类投资者对交易量、扩散和波动等市场观察的影响。我们相信,我们的工作将有利于行为融资的研究,并使人们更好地了解人类交易行为。