A lowering in the cost of batteries and solar PV systems has led to a high uptake of solar battery home systems. In this work, we use the deep deterministic policy gradient algorithm to optimise the charging and discharging behaviour of a battery within such a system. Our approach outputs a continuous action space when it charges and discharges the battery, and can function well in a stochastic environment. We show good performance of this algorithm by lowering the expenditure of a single household on electricity to almost \$1AUD for large batteries across selected weeks within a year.
翻译:电池和太阳能光电池系统成本的降低导致太阳能电池主机系统的高吸收率。 在这项工作中,我们使用深度确定性政策梯度算法优化电池在这种系统中的充电和放电行为。我们的方法产生一个连续的操作空间,当电池充电和放电时,并且能够在一个随机的环境中运行良好。我们通过将一个家庭在电力上的开支降低到近1美元来显示这一算法的良好表现,在一年内选定数周内将一个家庭在大型电池上的开支降低到近1美元。