As Smart Meters are collecting and transmitting household energy consumption data to Retail Energy Providers (REP), the main challenge is to ensure the effective use of fine-grained consumer data while ensuring data privacy. In this manuscript, we tackle this challenge for energy load consumption forecasting in regards to REPs which is essential to energy demand management, load switching and infrastructure development. Specifically, we note that existing energy load forecasting is centralized, which are not scalable and most importantly, vulnerable to data privacy threats. Besides, REPs are individual market participants and liable to ensure the privacy of their own customers. To address this issue, we propose a novel horizontal privacy-preserving federated learning framework for REPs energy load forecasting, namely FedREP. We consider a federated learning system consisting of a control centre and multiple retailers by enabling multiple REPs to build a common, robust machine learning model without sharing data, thus addressing critical issues such as data privacy, data security and scalability. For forecasting, we use a state-of-the-art Long Short-Term Memory (LSTM) neural network due to its ability to learn long term sequences of observations and promises of higher accuracy with time-series data while solving the vanishing gradient problem. Finally, we conduct extensive data-driven experiments using a real energy consumption dataset. Experimental results demonstrate that our proposed federated learning framework can achieve sufficient performance in terms of MSE ranging between 0.3 to 0.4 and is relatively similar to that of a centralized approach while preserving privacy and improving scalability.
翻译:随着智能电表收集和传输家庭能耗数据,主要挑战是确保有效利用细粒度的消费数据同时确保数据隐私。在本文中,我们针对能源负荷需求管理、负荷转移和基础设施开发等REP方面的能量负荷预测解决了这个挑战。具体而言,我们注意到现有的能源负荷预测是集中化的,这不可扩展,最重要的是,容易受到数据隐私威胁。此外,REPs是单独的市场参与者,有责任确保其自己客户的隐私。为了解决这个问题,我们提出了一种新颖的水平隐私保护联合学习框架,用于REP能量负荷预测,即FedREP。我们考虑一个联合学习系统,由控制中心和多个零售商组成,通过使多个REPs建立共同的、强大的机器学习模型而不共享数据,因此解决了关键问题,如数据隐私、数据安全和可扩展性。对于预测,我们使用最先进的长短记忆(LSTM)神经网络,因为它能够学习长期观察序列,并承诺使用时间序列数据具有更高的准确性,同时解决梯度消失问题。最后,我们使用真实的能耗数据集进行了广泛的数据驱动实验。实验结果表明,我们提出的联合学习框架可以在保留隐私和提高可扩展性的同时,在MSE方面取得足够的性能,范围在0.3到0.4之间,与集中化方法相对类似。