End-to-end data-driven machine learning methods often have exuberant requirements in terms of quality and quantity of training data which are often impractical to fulfill in real-world applications. This is specifically true in time series domain where problems like disaster prediction, anomaly detection, and demand prediction often do not have a large amount of historical data. Moreover, relying purely on past examples for training can be sub-optimal since in doing so we ignore one very important domain i.e knowledge, which has its own distinct advantages. In this paper, we propose a novel knowledge fusion architecture, Knowledge Enhanced Neural Network (KENN), for time series forecasting that specifically aims towards combining strengths of both knowledge and data domains while mitigating their individual weaknesses. We show that KENN not only reduces data dependency of the overall framework but also improves performance by producing predictions that are better than the ones produced by purely knowledge and data driven domains. We also compare KENN with state-of-the-art forecasting methods and show that predictions produced by KENN are significantly better even when trained on only 50\% of the data.
翻译:终端到终端数据驱动的机器学习方法往往在培训数据的质量和数量方面要求过高,而这些在现实应用中往往不切实际。在诸如灾害预测、异常探测和要求预测等问题往往没有大量历史数据的时间序列领域尤其如此。 此外,单纯依靠过去的培训实例可能不尽理想,因为这样做时我们忽略了一个非常重要的领域,即知识,而知识本身有其独特的优势。在本文中,我们提议建立一个新的知识融合结构,即知识增强神经网络(知识增强神经网络),用于时间序列预测,具体旨在将知识和数据领域的优势结合起来,同时减轻其个别弱点。我们表明,KENN不仅减少了整个框架对数据的依赖性,而且还通过提出比纯知识和数据驱动领域产生的更好预测来改进业绩。我们还将KENN与最先进的预测方法进行比较,并表明,即使仅对数据进行了50个百分点的培训,KENN的预测也大大改进了。