The rise of process data availability has recently led to the development of data-driven learning approaches. However, most of these approaches restrict the use of the learned model to predict the future of ongoing process executions. The goal of this paper is moving a step forward and leveraging available data to learning to act, by supporting users with recommendations derived from an optimal strategy (measure of performance). We take the optimization perspective of one process actor and we recommend the best activities to execute next, in response to what happens in a complex external environment, where there is no control on exogenous factors. To this aim, we investigate an approach that learns, by means of Reinforcement Learning, the optimal policy from the observation of past executions and recommends the best activities to carry on for optimizing a Key Performance Indicator of interest. The validity of the approach is demonstrated on two scenarios taken from real-life data.
翻译:程序数据提供量的上升最近导致了数据驱动学习方法的形成,然而,大多数这些方法限制了使用所学的模型来预测正在进行的处决过程的未来。本文件的目标是向前迈出一步,利用现有数据来学习行动,支持用户,根据最佳战略(业绩计量)提出建议。我们从一个进程行为者的优化角度出发,建议下一步开展最佳活动,以应对复杂的外部环境中发生的情况,因为外部环境对外部因素没有控制。为此,我们调查一种方法,通过强化学习,从观察过去的处决中学习最佳政策,并建议开展最佳活动,优化利益的关键业绩指标。该方法的有效性体现在从实际数据中得出的两种情景上。