Reliable remaining time prediction of ongoing business processes is a highly relevant topic. One example is order delivery, a key competitive factor in e.g. retailing as it is a main driver of customer satisfaction. For realising timely delivery, an accurate prediction of the remaining time of the delivery process is crucial. Within the field of process mining, a wide variety of remaining time prediction techniques have already been proposed. In this work, we extend remaining time prediction based on stochastic Petri nets with generally distributed transitions with k-nearest neighbors. The k-nearest neighbors algorithm is performed on simple vectors storing the time passed to complete previous activities. By only taking a subset of instances, a more representative and stable stochastic Petri Net is obtained, leading to more accurate time predictions. We discuss the technique and its basic implementation in Python and use different real world data sets to evaluate the predictive power of our extension. These experiments show clear advantages in combining both techniques with regard to predictive power.
翻译:对进行中的业务流程的可靠剩余时间预测是一个高度相关的专题。一个例子是订单交付,这是零售业中的一个关键竞争因素,因为它是客户满意度的主要驱动力。为了实现及时交付,准确预测交付过程的剩余时间至关重要。在流程采矿领域,已经提出了大量剩余时间预测技术。在这项工作中,我们扩大了基于随机彼得里网的剩余时间预测,与最近的邻居普遍进行过渡。K-最近邻算法是在储存过去完成活动所花时间的简单矢量上进行的。通过只收集一系列实例,获得一个更具有代表性和稳定性的Stochatic Petri Net,从而导致更准确的时间预测。我们在Python讨论该技术及其基本应用,并使用不同的真实世界数据集来评价我们扩展的预测能力。这些实验在将这两种技术与预测能力相结合方面显示出明显的优势。