To identify the causes of performance problems or to predict process behavior, it is essential to have correct and complete event data. This is particularly important for distributed systems with shared resources, e.g., one case can block another case competing for the same machine, leading to inter-case dependencies in performance. However, due to a variety of reasons, real-life systems often record only a subset of all events taking place. For example, to reduce costs, the number of sensors is minimized or parts of the system are not connected. To understand and analyze the behavior of processes with shared resources, we aim to reconstruct bounds for timestamps of events that must have happened but were not recorded. We present a novel approach that decomposes system runs into entity traces of cases and resources that may need to synchronize in the presence of many-to-many relationships. Such relationships occur, for example, in warehouses where packages for N incoming orders are not handled in a single delivery but in M different deliveries. We use linear programming over entity traces to derive the timestamps of unobserved events in an efficient manner. This helps to complete the event logs and facilitates analysis. We focus on material handling systems like baggage handling systems in airports to illustrate our approach. However, the approach can be applied to other settings where recording is incomplete. The ideas have been implemented in ProM and were evaluated using both synthetic and real-life event logs.
翻译:为了查明性能问题的原因或预测过程行为,必须具备正确和完整的事件数据。这对于分布式系统具有共享资源,这特别重要,例如,一个案件可以阻拦另一个案件竞争同一机器,从而导致在性能方面出现相互依赖的情况。然而,由于各种原因,实际生活系统往往只记录发生的所有事件的一个子集。例如,为了降低成本,传感器的数量被最小化,或者系统的某些部分没有连接。为了理解和分析使用共享资源的过程行为,我们的目标是重建必须发生但未记录的事件的时间标记的界限。我们提出了一个新颖的方法,即拆分系统进入实体案件和资源的痕迹,在存在许多对多种关系的情况下可能需要同步。例如,在仓库里,N进货的包不是一次性交货,而是M型不同交货。我们用线性程序对实体的痕迹进行跟踪,以便以高效的方式获取未观测的事件的时间标记。这有利于完成事件记录过程的进度,从而帮助完成事件记录过程,并便利分析。我们用不完全的系统来记录。我们用不完全的系统来记录。我们用不完全的系统来评估。