The relationship between electricity demand and weather has been established for a long time and is one of the cornerstones in load prediction for operation and planning, along with behavioral and social aspects such as calendars or significant events. This paper explores how and why the social information contained in the news can be used better to understand aggregate population behaviour in terms of energy demand. The work is done through experiments analysing the impact of predicting features extracted from national news on day-ahead electric demand prediction. The results are compared to a benchmark model trained exclusively on the calendar and meteorological information. Experimental results showed that the best-performing model reduced the official standard errors around 4%, 11%, and 10% in terms of RMSE, MAE, and SMAPE. The best-performing methods are: word frequency identified COVID-19-related keywords; topic distribution that identified news on the pandemic and internal politics; global word embeddings that identified news about international conflicts. This study brings a new perspective to traditional electricity demand analysis and confirms the feasibility of improving its predictions with unstructured information contained in texts, with potential consequences in sociology and economics.
翻译:长期以来,电力需求与天气之间的关系已经确立,是运行和规划负荷预测的基石之一,同时也是日历或重大事件等行为和社会方面的基石之一。本文探讨了如何和为什么可以更好地利用新闻中所包含的社会信息,从能源需求方面更好地了解总人口的行为。这项工作是通过实验来分析从国家新闻中提取的日头电需求预测中得出的特征的预测影响完成的。结果与完全在日历和气象信息上培训的基准模型进行了比较。实验结果表明,最佳模型减少了官方标准错误,在RUSE、MAE和SMAPE方面大约4%、11%和10%。最有效的方法是:文字频率确定与COVID-19相关的关键词;主题发布确定了有关流行病和国内政治的新闻;全球语言嵌入了有关国际冲突的新闻。这项研究为传统的电力需求分析带来了新的视角,并证实了用文本中未结构化的信息改进预测的可行性,并可能给社会学和经济带来潜在后果。