Process discovery aims to discover models that can explain the behaviors of event logs extracted from information systems. While various approaches have been proposed, only a few guarantee desirable properties such as soundness and free-choice. State-of-the-art approaches that exploit the representational bias of process trees to provide the guarantees are constrained to be block-structured.Such constructs limit the expressive power of the discovered models, i.e., only a subset of sound free-choice workflow nets can be discovered. To support a more flexible structural representation, we aim to discover process models that provide the same guarantees but also allow for non-block structures. Inspired by existing works that utilize synthesis rules from the free-choice nets theory, we propose an automatic approach that incrementally adds activities to an existing process model with predefined patterns. Playing by the rules ensures that the resulting models are always sound and free-choice. Furthermore, the discovered models are not restricted to block structures and are thus more flexible. The approach has been implemented in Python and tested using various real-life event logs. The experiments show that our approach can indeed discover models with competitive quality and more flexible structures compared to the existing approach.
翻译:过程发现的目的是发现能够解释从信息系统中提取的事件日志行为的模型。虽然提出了各种办法,但只有少数几种保证可取的特性,如稳妥性和自由选择。利用流程树代表偏差提供保障的先进方法受到限制,只能形成块状结构。这种结构限制了所发现模型的表达力,即,只能发现一组健全的自由选择工作流程网。为了支持更灵活的结构代表性,我们的目标是发现提供相同保障但也允许非区块结构的流程模型。在使用自由选择网理论综合规则的现有工作启发下,我们提出了一种自动方法,将活动逐步地添加到具有预先确定模式的现有流程模型中。运用这些规则可以确保所产生的模型始终是稳妥和自由的。此外,所发现的模式并不局限于屏蔽结构,因此更加灵活。该方法已在Python实施,并使用各种真实事件日志进行了测试。实验表明,我们的方法确实能够发现具有竞争性的质量和比现有结构更灵活的模型。