While the advanced machine learning algorithms are effective in load forecasting, they often suffer from low data utilization, and hence their superior performance relies on massive datasets. This motivates us to design machine learning algorithms with improved data utilization. Specifically, we consider the load forecasting for a new user in the system by observing only few shots (data points) of its energy consumption. This task is challenging since the limited samples are insufficient to exploit the temporal characteristics, essential for load forecasting. Nonetheless, we notice that there are not too many temporal characteristics for residential loads due to the limited kinds of human lifestyle. Hence, we propose to utilize the historical load profile data from existing users to conduct effective clustering, which mitigates the challenges brought by the limited samples. Specifically, we first design a feature extraction clustering method for categorizing historical data. Then, inheriting the prior knowledge from the clustering results, we propose a two-phase Long Short Term Memory (LSTM) model to conduct load forecasting for new users. The proposed method outperforms the traditional LSTM model, especially when the training sample size fails to cover a whole period (i.e., 24 hours in our task). Extensive case studies on two real-world datasets and one synthetic dataset verify the effectiveness and efficiency of our method.
翻译:虽然先进的机器学习算法在载荷预测方面是有效的,但它们往往受到低数据利用的影响,因此其优异性能依赖于大量数据集。这促使我们设计机器学习算法,改进数据利用。具体地说,我们考虑系统新用户的负荷预测,只观察其能源消耗的几个镜头(数据点)。这项任务具有挑战性,因为有限的样品不足以利用时间特性,而对于载荷预测来说是必不可少的。然而,我们注意到,由于人类生活方式的种类有限,住宅负荷没有太多的时间特性。因此,我们提议利用现有用户的历史负载剖析数据进行有效的集成,以减轻有限样品带来的各种挑战。具体地说,我们首先设计一种特征提取组集成方法,对历史数据进行分类。然后,从集成结果中继承先前的知识,我们提出一个两阶段长的短期内存(LSTM)模型,为新用户进行负载预报。拟议的方法比传统的LSTM模型要差得多,特别是当培训样品大小无法覆盖整个时期(即我们任务中的24小时)时。我们先用它来核查一个合成数据的效率。