Process mining enables business owners to discover and analyze their actual processes using event data that are widely available in information systems. Event data contain detailed information which is incredibly valuable for providing insights. However, such detailed data often include highly confidential and private information. Thus, concerns of privacy and confidentiality in process mining are becoming increasingly relevant and new techniques are being introduced. To make the techniques easily accessible, new tools need to be developed to integrate the introduced techniques and direct users to appropriate solutions based on their needs. In this paper, we present a Python-based infrastructure implementing and integrating state-of-the-art privacy/confidentiality preservation techniques in process mining. Our tool provides an easy-to-use web-based user interface for privacy-preserving data publishing, risk analysis, and data utility analysis. The tool also provides a set of anonymization operations that can be utilized to support privacy/confidentiality preservation. The tool manages both standard XES event logs and non-standard event data. We also store and manage privacy metadata to track the changes made by privacy/confidentiality preservation techniques.
翻译:开采过程使企业主能够利用信息系统中广泛获得的事件数据发现和分析其实际过程。事件数据包含详细信息,对提供洞察力极有价值。然而,这种详细数据往往包括高度机密和私人信息。因此,对采矿过程中隐私和保密的关切越来越重要,而且正在采用新技术。为了使技术容易获得,需要开发新的工具,以便根据用户的需要,将引进的技术纳入到适当的解决方案中。在本文件中,我们介绍了一个基于Python的基础设施,在采矿过程中实施和整合最新隐私/保密技术。我们的工具为隐私保护数据发布、风险分析和数据效用分析提供了一个方便使用的网络用户界面。该工具还提供一套可以用来支持隐私/保密的匿名作业。该工具管理标准的XES事件日志和非标准事件数据。我们还储存和管理隐私元数据,以跟踪隐私/保密技术的变化。