Electricity systems are key to transforming today's society into a carbon-free economy. Long-term electricity market mechanisms, including auctions, support schemes, and other policy instruments, are critical in shaping the electricity generation mix. In light of the need for more advanced tools to support policymakers and other stakeholders in designing, testing, and evaluating long-term markets, this work presents a multi-agent reinforcement learning model capable of capturing the key features of decarbonizing energy systems. Profit-maximizing generation companies make investment decisions in the wholesale electricity market, responding to system needs, competitive dynamics, and policy signals. The model employs independent proximal policy optimization, which was selected for suitability to the decentralized and competitive environment. Nevertheless, given the inherent challenges of independent learning in multi-agent settings, an extensive hyperparameter search ensures that decentralized training yields market outcomes consistent with competitive behavior. The model is applied to a stylized version of the Italian electricity system and tested under varying levels of competition, market designs, and policy scenarios. Results highlight the critical role of market design for decarbonizing the electricity sector and avoiding price volatility. The proposed framework allows assessing long-term electricity markets in which multiple policy and market mechanisms interact simultaneously, with market participants responding and adapting to decarbonization pathways.
翻译:电力系统是推动当今社会向无碳经济转型的关键。长期电力市场机制,包括拍卖、支持计划及其他政策工具,对于塑造发电结构至关重要。鉴于决策者及其他利益相关方在设计、测试和评估长期市场时需要更先进的工具支持,本研究提出一种多智能体强化学习模型,能够捕捉能源系统脱碳进程的关键特征。以利润最大化为目标的发电公司在电力批发市场中做出投资决策,响应系统需求、竞争态势与政策信号。该模型采用独立近端策略优化算法,因其适用于去中心化竞争环境而被选用。然而,考虑到多智能体环境中独立学习的内在挑战,我们通过大规模超参数搜索确保去中心化训练产生的市场结果与竞争行为保持一致。该模型应用于意大利电力系统的简化版本,并在不同竞争程度、市场设计和政策情景下进行测试。结果凸显了市场设计对于电力行业脱碳和避免价格波动的关键作用。所提出的框架能够评估多重政策与市场机制同时交互的长期电力市场,其中市场参与者对脱碳路径做出响应与适应。