A cornerstone of the worldwide transition to smart grids are smart meters. Smart meters typically collect and provide energy time series that are vital for various applications, such as grid simulations, fault-detection, load forecasting, load analysis, and load management. Unfortunately, these time series are often characterized by missing values that must be handled before the data can be used. A common approach to handle missing values in time series is imputation. However, existing imputation methods are designed for power time series and do not take into account the total energy of gaps, resulting in jumps or constant shifts when imputing energy time series. In order to overcome these issues, the present paper introduces the new Copy-Paste Imputation (CPI) method for energy time series. The CPI method copies data blocks with similar properties and pastes them into gaps of the time series while preserving the total energy of each gap. The new method is evaluated on a real-world dataset that contains six shares of artificially inserted missing values between 1 and 30%. It outperforms by far the three benchmark imputation methods selected for comparison. The comparison furthermore shows that the CPI method uses matching patterns and preserves the total energy of each gap while requiring only a moderate run-time.
翻译:全世界向智能网格过渡的基石是智能仪表。智能仪表通常收集和提供对各种应用至关重要的能源时间序列,例如电网模拟、故障检测、负载预测、负载分析和负载管理。不幸的是,这些时间序列的特点往往是缺少数值,在数据使用之前必须处理这些数值。处理时间序列缺失值的共同方法是估算。但是,现有的估算方法是为电力时间序列设计的,没有考虑到差距的总能量,导致在计算能源时间序列时跳动或不断移动。为了克服这些问题,本文件介绍了新的能源时间序列的复制-帕斯特光化方法(CPI) 。CPI 方法将具有类似特性的数据块复制成时间序列的空白,同时保存每个差距的总能量。新的方法是在真实世界数据集中进行评估,其中含有人工插入的1至30 %的缺失值的六分之比。它远高于为比较而选择的三种基准参数化参数化方法。比较表明,每使用一种中度能源模型时,CPI方法只使用中度模式。