Predictive business process monitoring is concerned with the prediction how a running process instance will unfold up to its completion at runtime. Most of the proposed approaches rely on a wide number of different machine learning (ML) techniques. In the last years numerous comparative studies, reviews, and benchmarks of such approaches where published and revealed that they can be successfully applied for different prediction targets. ML techniques require a qualitatively and quantitatively sufficient data set. However, there are many situations in business process management (BPM) where only a quantitatively insufficient data set is available. The problem of insufficient data in the context of BPM is still neglected. Hence, none of the comparative studies or benchmarks investigates the performance of predictive business process monitoring techniques in environments with small data sets. In this paper an evaluation framework for comparing existing approaches with regard to their suitability for small data sets is developed and exemplarily applied to state-of-the-art approaches in predictive business process monitoring.
翻译:预测性的业务流程监测涉及预测一个运行过程实例将如何在运行时完成的预测,大多数拟议办法依赖多种不同的机器学习技术。在过去几年中,对公布和显示可以成功应用于不同预测目标的这类方法进行了许多比较研究、审查和基准,这些研究、审查和基准表明,这些方法可以成功地应用于不同的预测目标。ML技术需要一套质量和数量上足够的数据集。然而,在业务流程管理中有许多情况,只有数量上不足的数据集。在业务流程管理中,数据不足的问题仍然被忽视。因此,没有一项比较研究或基准调查在使用小数据集的环境中预测业务流程监测技术的性能。在本文件中,开发了一个评价框架,用以比较现有方法是否适合小型数据集,并示范用于预测业务流程监测方面的最新方法。