In this paper, we analyze the impact of information freshness on supervised learning based forecasting. In these applications, a neural network is trained to predict a time-varying target (e.g., solar power), based on multiple correlated features (e.g., temperature, humidity, and cloud coverage). The features are collected from different data sources and are subject to heterogeneous and time-varying ages. By using an information-theoretic approach, we prove that the minimum training loss is a function of the ages of the features, where the function is not always monotonic. However, if the empirical distribution of the training data is close to the distribution of a Markov chain, then the training loss is approximately a non-decreasing age function. Both the training loss and testing loss depict similar growth patterns as the age increases. An experiment on solar power prediction is conducted to validate our theory. Our theoretical and experimental results suggest that it is beneficial to (i) combine the training data with different age values into a large training dataset and jointly train the forecasting decisions for these age values, and (ii) feed the age value as a part of the input feature to the neural network.
翻译:在本文中,我们分析了信息更新对有监督的基于学习的预测的影响。在这些应用中,神经网络根据多个相关特征(如温度、湿度和云的覆盖范围),根据多个相关特征(如温度、湿度和云层覆盖率),对神经网络进行了培训,以预测时间变化的目标(如太阳能发电)。特征来自不同的数据来源,并受到不同和时间变化年龄的影响。通过使用信息理论方法,我们证明最低培训损失是特性年龄的函数,而功能并不总是单一。但是,如果培训数据的实际分布接近于马尔科夫链的分布,那么培训损失大约是一个非下降年龄值的功能。培训损失和测试都描述了随着年龄增长的类似模式。进行太阳能能力预测实验是为了验证我们的理论。我们的理论和实验结果表明,它有利于(一)将培训数据与不同年龄值合并成一个大型培训数据集,并联合培训这些年龄值的预测决定,以及(二)作为输入神经网络的一个输入特征的一部分,将年龄值纳入。