Numerous process discovery techniques exist for generating process models that describe recorded executions of business processes. The models are meant to generalize executions into human-understandable modeling patterns, notably parallelism, and enable rigorous analysis of process deviations. However, well-defined models with parallelism returned by existing techniques are often too complex or generalize the recorded behavior too strongly to be trusted in a practical business context. We bridge this gap by introducing the Probabilistic Inductive Miner (PIM) based on the Inductive Miner framework. PIM compares in each step the most probable operators and structures based on frequency information in the data, which results in block-structured models with significantly higher accuracy. All design choices in PIM are based on business context requirements obtained through a user study with industrial process mining experts. PIM is evaluated quantitatively and in an novel kind of empirical study comparing users' trust in discovered model structures. The evaluations show that PIM strikes a unique trade-off between model accuracy and model complexity, that is conclusively preferred by users over all state-of-the-art process discovery methods.
翻译:有许多过程发现技术可用于生成描述有记录的业务流程执行过程的流程模型。这些模型旨在将处决普遍化为人类可理解的模型模式,特别是平行模式,并能够对过程偏差进行严格分析。然而,现有技术所恢复的明确界定的平行模式往往过于复杂或过于笼统,无法在实际商业环境中信任记录的行为。我们通过采用基于诱导采矿框架的概率诱导采矿器(PIM)来弥补这一差距。PIM在每一步中根据数据中的频率信息对最可能操作者和结构进行对比,从而导致形成条块状结构模型。PIM的所有设计选择都基于通过与工业过程采矿专家的用户研究获得的商业背景要求。PIM在数量上和实验性研究中进行了评价,比较用户对所发现的模型结构的信任。评价表明,PIM在模型准确性和模型复杂度之间发生了一种独特的交易,用户确实选择了所有最先进的过程发现方法。