Process discovery aims to learn process models from observed behaviors, i.e., event logs, in the information systems.The discovered models serve as the starting point for process mining techniques that are used to address performance and compliance problems. Compared to the state-of-the-art Inductive Miner, the algorithm applying synthesis rules from the free-choice net theory discovers process models with more flexible (non-block) structures while ensuring the same desirable soundness and free-choiceness properties. Moreover, recent development in this line of work shows that the discovered models have compatible quality. Following the synthesis rules, the algorithm incrementally modifies an existing process model by adding the activities in the event log one at a time. As the applications of rules are highly dependent on the existing model structure, the model quality and computation time are significantly influenced by the order of adding activities. In this paper, we investigate the effect of different ordering strategies on the discovered models (w.r.t. fitness and precision) and the computation time using real-life event data. The results show that the proposed ordering strategy can improve the quality of the resulting process models while requiring less time compared to the ordering strategy solely based on the frequency of activities.
翻译:过程发现的目的是从信息系统中观测到的行为(即事件日志)中学习过程模型。 发现的模型是用于处理性能和合规问题的加工采矿技术的起点。 与最先进的感化矿工相比,应用自由选择网理论综合规则的算法以更灵活的(非区块)结构来发现过程模型,同时确保同样可取的健全性和自由选择性特性。 此外,这一行工作的最新发展表明,所发现的模式具有兼容性。 按照综合规则,算法逐渐改变现有的过程模型,在一次中添加活动记录。由于规则的应用高度依赖现有模式结构,模型质量和计算时间受到增加活动的顺序的极大影响。 在本文件中,我们调查不同排序战略对所发现的模式(w.r.t.健康与准确性)的影响,以及使用现实事件数据的计算时间。结果显示,拟议的定序战略可以提高所产生过程模型的质量,同时比定序活动要求的时间要少得多。