Autonomous pricing algorithms are increasingly influencing competition in digital markets; however, their behavior under realistic demand conditions remains largely unexamined. This paper offers a thorough analysis of four pricing algorithms -- Q-Learning, PSO, Double DQN, and DDPG -- across three classic duopoly models (Logit, Hotelling, Linear) and under various demand-shock regimes created by auto-regressive processes. By utilizing profit- and price-based collusion indices, we investigate how the interactions among algorithms, market structure, and stochastic demand collaboratively influence competitive outcomes. Our findings reveal that reinforcement-learning algorithms often sustain supra-competitive prices under stable demand, with DDPG demonstrating the most pronounced collusive tendencies. Demand shocks produce notably varied effects: Logit markets suffer significant performance declines, Hotelling markets remain stable, and Linear markets experience shock-induced profit inflation. Despite marked changes in absolute performance, the relative rankings of the algorithms are consistent across different environments. These results underscore the critical importance of market structure and demand uncertainty in shaping algorithmic competition, while also contributing to the evolving policy discussions surrounding autonomous pricing behavior.
翻译:自主定价算法正日益影响数字市场的竞争格局,然而其在现实需求条件下的行为仍鲜有研究。本文对四种定价算法——Q-Learning、PSO、Double DQN和DDPG——在三种经典双寡头模型(Logit、Hotelling、Linear)以及由自回归过程生成的不同需求冲击机制下进行了系统分析。通过采用基于利润和价格的合谋指数,我们探究了算法交互、市场结构与随机需求如何共同影响竞争结果。研究发现,强化学习算法在稳定需求下常能维持超竞争价格,其中DDPG表现出最显著的合谋倾向。需求冲击产生显著异质性影响:Logit市场出现明显的性能衰退,Hotelling市场保持稳定,而Linear市场则经历冲击引发的利润膨胀。尽管绝对性能发生显著变化,各算法的相对排名在不同环境中保持稳定。这些结果凸显了市场结构和需求不确定性在塑造算法竞争中的关键作用,同时为围绕自主定价行为的政策讨论提供了新的实证依据。