The real-time prediction of business processes using historical event data is an important capability of modern business process monitoring systems. Existing process prediction methods are able to also exploit the data perspective of recorded events, in addition to the control-flow perspective. However, while well-structured numerical or categorical attributes are considered in many prediction techniques, almost no technique is able to utilize text documents written in natural language, which can hold information critical to the prediction task. In this paper, we illustrate the design, implementation, and evaluation of a novel text-aware process prediction model based on Long Short-Term Memory (LSTM) neural networks and natural language models. The proposed model can take categorical, numerical and textual attributes in event data into account to predict the activity and timestamp of the next event, the outcome, and the cycle time of a running process instance. Experiments show that the text-aware model is able to outperform state-of-the-art process prediction methods on simulated and real-world event logs containing textual data.
翻译:利用历史事件数据实时预测业务流程是现代业务流程监测系统的一个重要能力;现有流程预测方法除了控制-流程角度外,还能够利用记录事件的数据视角;然而,虽然在许多预测技术中考虑到结构完善的数字或绝对属性,但几乎没有任何技术能够利用自然语言编写的文本文件,这些文件能够提供对预测任务至关重要的信息;在本文件中,我们举例说明基于长期短期内存神经网络和自然语言模型的新颖的文本认知进程预测模型的设计、实施和评价;拟议的模型可以将绝对性、数字性和文字属性纳入假设数据,以预测下一个事件的活动和时间印记、结果和运行过程实例的周期。实验显示,文本觉模型能够超越包含文字数据的模拟和现实世界事件日志的状态-工艺预测方法。