As the number of installed meters in buildings increases, there is a growing number of data time-series that could be used to develop data-driven models to support and optimize building operation. However, building data sets are often characterized by errors and missing values, which are considered, by the recent research, among the main limiting factors on the performance of the proposed models. Motivated by the need to address the problem of missing data in building operation, this work presents a data-driven approach to fill these gaps. In this study, three different autoencoder neural networks are trained to reconstruct missing short-term indoor environment data time-series in a data set collected in an office building in Aachen, Germany. This consisted of a four year-long monitoring campaign in and between the years 2014 and 2017, of 84 different rooms. The models are applicable for different time-series obtained from room automation, such as indoor air temperature, relative humidity and $CO_{2}$ data streams. The results prove that the proposed methods outperform classic numerical approaches and they result in reconstructing the corresponding variables with average RMSEs of 0.42 {\deg}C, 1.30 % and 78.41 ppm, respectively.
翻译:随着建筑中安装的仪表仪表数的增加,越来越多的数据时间序列可用于开发支持和优化建筑运行的数据驱动模型,然而,建筑数据集的特征往往是错误和缺失值,而最近的研究认为,这些错误和缺失值是拟议模型性能的主要限制因素之一。由于需要解决建筑运行中缺失的数据问题,这项工作提出了填补这些空白的数据驱动方法。在这项研究中,三个不同的自动神经网络接受了培训,以重建德国亚琴办公大楼收集的数据集中缺失的短期室内环境数据时间序列,其中包括2014年至2017年期间为期四年的84个不同房间的监测活动。这些模型适用于从室内自动化获得的不同时间序列,如室内空气温度、相对湿度和$CO ⁇ 2}数据流。结果证明,拟议方法优于典型的数字方法,导致对相应的变量进行重建,平均RMEE0.42 exdeg}C、1.30 和78 ppm ppm。