Object-centric process mining is a new paradigm with more realistic assumptions about underlying data by considering several case notions, e.g., an order handling process can be analyzed based on order, item, package, and route case notions. Including many case notions can result in a very complex model. To cope with such complexity, this paper introduces a new approach to cluster similar case notions based on Markov Directly-Follow Multigraph, which is an extended version of the well-known Directly-Follow Graph supported by many industrial and academic process mining tools. This graph is used to calculate a similarity matrix for discovering clusters of similar case notions based on a threshold. A threshold tuning algorithm is also defined to identify sets of different clusters that can be discovered based on different levels of similarity. Thus, the cluster discovery will not rely on merely analysts' assumptions. The approach is implemented and released as a part of a python library, called processmining, and it is evaluated through a Purchase to Pay (P2P) object-centric event log file. Some discovered clusters are evaluated by discovering Directly Follow-Multigraph by flattening the log based on the clusters. The similarity between identified clusters is also evaluated by calculating the similarity between the behavior of the process models discovered for each case notion using inductive miner based on footprints conformance checking.
翻译:以物体为中心的开采过程是一种新范例,它通过考虑几个案例概念,对基础数据有更现实的假设,例如,可以根据顺序、项目、包和路线案例概念分析处理程序,根据顺序、项目、包和路线案例概念进行分析。包括许多案例概念可以导致非常复杂的模式。为了应付这种复杂情况,本文件引入了一种新的方法,根据Markov Directly-Fool-Fool-Multigraph对类似案例概念进行分组。Markov Directly-Fool-Fool-Fool-Formagraphy(这是由许多工业和学术过程采矿工具支持的直接追踪图的扩展版)。这个图表用来计算一个相似的矩阵,用于发现基于临界值的类似案件组群群。一个阈值调算法还用来确定根据不同程度的相似组群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群集。这个叫过程,通过根据所发现的直接跟踪测量定记录群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群群,对。