Process mining aims to provide insights into the actual processes based on event data. These data are often recorded by information systems and are widely available. However, they often contain sensitive private information that should be analyzed responsibly. Therefore, privacy issues in process mining are recently receiving more attention. Privacy preservation techniques obviously need to modify the original data, yet, at the same time, they are supposed to preserve the data utility. Privacy-preserving transformations of the data may lead to incorrect or misleading analysis results. Hence, new infrastructures need to be designed for publishing the privacy-aware event data whose aim is to provide metadata regarding the privacy-related transformations on event data without revealing details of privacy preservation techniques or the protected information. In this paper, we provide formal definitions for the main anonymization operations, used by privacy models in process mining. These are used to create an infrastructure for recording the privacy metadata. We advocate the proposed privacy metadata in practice by designing a privacy extension for the XES standard and a general data structure for event data which are not in the form of standard event logs.
翻译:采矿过程的目的是根据事件数据提供对实际过程的洞察力,这些数据往往由信息系统记录,并且广泛提供,但往往含有应负责任地分析的敏感私人信息,因此,采矿过程中的隐私问题最近受到更多的注意。隐私保护技术显然需要修改原始数据,但与此同时,它们应该维护数据效用。数据的隐私保护转换可能导致不正确或误导分析结果。因此,需要设计新的基础设施,公布隐私意识事件数据,目的是提供事件数据与隐私有关的转换元数据,而没有披露隐私保护技术或受保护信息的细节。在本文件中,我们为隐私模型在采矿过程中使用的主要匿名作业提供了正式定义。这些定义用于为记录隐私元数据建立一个基础设施。我们在实践中倡导拟议的隐私元数据,为XES标准设计隐私扩展,并为非标准事件记录形式的事件数据设计一个一般数据结构。