The extraction, transformation, and loading of event logs from information systems is the first and the most expensive step in process mining. In particular, extracting event logs from popular ERP systems such as SAP poses major challenges, given the size and the structure of the data. Open-source support for ETL is scarce, while commercial process mining vendors maintain connectors to ERP systems supporting ETL of a limited number of business processes in an ad-hoc manner. In this paper, we propose an approach to facilitate event data extraction from SAP ERP systems. In the proposed approach, we store event data in the format of object-centric event logs that efficiently describe executions of business processes supported by ERP systems. To evaluate the feasibility of the proposed approach, we have developed a tool implementing it and conducted case studies with a real-life SAP ERP system.
翻译:从信息系统中提取、转换和装载事件日志是开采过程中第一个也是最昂贵的步骤,特别是,鉴于数据的规模和结构,从SAP等流行的ERP系统提取事件日志构成重大挑战。对ETL的开放源码支持很少,而商业进程采矿供应商以临时方式保持与ERP系统的连接器,支持有限数量的业务流程的ETL。在本文件中,我们提出了便利从SAPERP系统提取事件数据的方法。在拟议办法中,我们以以以以物体为中心的事件日志的形式存储事件数据,有效地描述ERP系统所支持的业务流程的运行情况。为了评估拟议方法的可行性,我们开发了一个实施工具,并用实际的SAPERP系统进行案例研究。