In the process of collecting data from sensors, several circumstances can affect their continuity and validity, resulting in alterations of the data or loss of information. Although classical methods of statistics, such as interpolation-like techniques, can be used to approximate the missing data in a time series, the recent developments in Deep Learning (DL) have given impetus to innovative and much more accurate forecasting techniques. In the present paper, we develop two DL models aimed at filling data gaps, for the specific case of internal temperature time series obtained from monitored apartments located in Bolzano, Italy. The DL models developed in the present work are based on the combination of Convolutional Neural Networks (CNNs), Long Short-Term Memory Neural Networks (LSTMs), and Bidirectional LSTMs (BiLSTMs). Two key features of our models are the use of both pre- and post-gap data, and the exploitation of a correlated time series (the external temperature) in order to predict the target one (the internal temperature). Our approach manages to capture the fluctuating nature of the data and shows good accuracy in reconstructing the target time series. In addition, our models significantly improve the already good results from another DL architecture that is used as a baseline for the present work.
翻译:在从传感器收集数据的过程中,若干情况可能影响其连续性和有效性,导致数据改变或信息丢失。虽然典型的统计方法,如内推式技术,可以用来在时间序列中估计缺失的数据,但深入学习(DL)的最近发展推动了创新和更准确的预报技术。在本文件中,我们开发了两个DL模型,旨在填补数据缺口,以填补从意大利博尔扎诺的受监测公寓获得的内部温度时间序列的具体案例。在目前工作中开发的DL模型以动态神经网络(CNNs)、长期短期内存神经网络(LSTMs)和双向内存LSTMS(BILSTMs)的组合为基础。我们模型的两个关键特征是使用前和后数据,以及利用一个相关时间序列(外部温度)来预测目标1(内部温度)。我们用来收集数据波动性质并显示目标时间序列重建中良好准确性的方法。此外,我们所使用的另一个模型已经大大改进了当前基准结构的另一种模型。