Process mining bridges the gap between process management and data science by discovering process models using event logs derived from real-world data. Besides mandatory event attributes, additional attributes can be part of an event representing domain data, such as human resources and costs. Data-enhanced process models provide a visualization of domain data associated to process activities directly in the process model, allowing to monitor the actual values of domain data in the form of event attribute aggregations. However, event logs can have so many attributes that it is difficult to decide, which one is of interest to observe throughout the process. This paper introduces three mechanisms to support domain data selection, allowing process analysts and domain experts to progressively reach their information of interest. We applied the proposed technique on the MIMIC-IV real-world data set on hospitalizations in the US.
翻译:利用来自真实世界数据的事件日志发现过程模型以发现过程管理与数据科学之间的差距。除了强制性事件属性外,额外的属性也可以是代表域数据的活动的一部分,如人力资源和成本。数据增强过程模型提供了直接与过程模型中进程活动相关的域数据的可视化,从而能够以事件属性聚合的形式监测域数据的实际值。然而,事件日志可能有如此多的属性,难以确定,在整个过程中,哪些是值得观察的。本文介绍了支持域数据选择的三个机制,使过程分析员和域专家能够逐步获得他们感兴趣的信息。我们应用了拟议的MIMIC-IV实际世界数据集关于美国住院情况的技术。