In modern business processes, the amount of data collected has increased substantially in recent years. Because this data can potentially yield valuable insights, automated knowledge extraction based on process mining has been proposed, among other techniques, to provide users with intuitive access to the information contained therein. At present, the majority of technologies aim to reconstruct explicit business process models. These are directly interpretable but limited concerning the integration of diverse and real-valued information sources. On the other hand, Machine Learning (ML) benefits from the vast amount of data available and can deal with high-dimensional sources, yet it has rarely been applied to being used in processes. In this contribution, we evaluate the capability of modern Transformer architectures as well as more classical ML technologies of modeling process regularities, as can be quantitatively evaluated by their prediction capability. In addition, we demonstrate the capability of attentional properties and feature relevance determination by highlighting features that are crucial to the processes' predictive abilities. We demonstrate the efficacy of our approach using five benchmark datasets and show that the ML models are capable of predicting critical outcomes and that the attention mechanisms or XAI components offer new insights into the underlying processes.
翻译:近年来,在现代业务流程中,收集的数据数量大幅增加,因为这些数据有可能产生宝贵的洞察力,因此,除其他技术外,还提出了基于工艺开采的自动知识提取方法,以便为用户提供直观的获取其中所含信息的途径。目前,大多数技术旨在重建明确的业务流程模型,这些在整合多样化和有实际价值的信息源方面直接可以解释,但数量有限。另一方面,机器学习(ML)从大量现有数据中受益,可以处理高维来源,但很少用于流程中。我们评估了现代变异器结构的能力以及更经典的ML程序常规化技术,可以通过预测能力进行定量评估。此外,我们通过突出对流程预测能力至关重要的特征,展示了关注特性和特征相关性确定的能力。我们用五个基准数据集展示了我们的方法的有效性,并表明ML模型能够预测关键结果,关注机制或XAI组件为基本流程提供了新的洞察力。