In many real-world applications, sequential rule mining (SRM) can provide prediction and recommendation functions for a variety of services. It is an important technique of pattern mining to discover all valuable rules that belong to high-frequency and high-confidence sequential rules. Although several algorithms of SRM are proposed to solve various practical problems, there are no studies on target sequential rules. Targeted sequential rule mining aims at mining the interesting sequential rules that users focus on, thus avoiding the generation of other invalid and unnecessary rules. This approach can further improve the efficiency of users in analyzing rules and reduce the consumption of data resources. In this paper, we provide the relevant definitions of target sequential rule and formulate the problem of targeted sequential rule mining. Furthermore, we propose an efficient algorithm, called targeted sequential rule mining (TaSRM). Several pruning strategies and an optimization are introduced to improve the efficiency of TaSRM. Finally, a large number of experiments are conducted on different benchmarks, and we analyze the results in terms of their running time, memory consumption, and scalability, as well as query cases with different query rules. It is shown that the novel algorithm TaSRM and its variants can achieve better experimental performance compared to the existing baseline algorithm.
翻译:在许多实际应用中,顺序规则采矿(SRM)可为各种服务提供预测和建议功能,这是模式采矿的一种重要技术,可以发现属于高频和高信任顺序规则的所有有价值的规则。虽然提出了若干次SRM算法以解决各种实际问题,但没有研究目标顺序规则。定向顺序规则采矿的目的是开采用户关注的有趣的顺序规则,从而避免产生其他无效和不必要的规则。这种方法可以进一步提高用户分析规则的效率,减少数据资源的消耗。在本文中,我们提供了目标顺序规则的相关定义,并提出了目标顺序规则采矿问题。此外,我们提出了一种高效算法,称为目标顺序规则采矿(TaSRM)。引入了几项调整战略和优化以提高TASRM的效率。最后,对不同的基准进行了大量实验,我们分析了运行时间、记忆消耗和可缩放性的结果,以及不同查询规则的查询案例。我们发现,新奇的TaSRM及其变量可以比现有基线更好地实现实验性算法。