Association rule mining is one of the most studied research fields of data mining, with applications ranging from grocery basket problems to highly explainable classification systems. Classical association rule mining algorithms have several flaws especially with regards to their execution times, memory usage and number of rules produced. An alternative is the use of meta-heuristics, which have been used on several optimisation problems. This paper has two objectives. First, we provide a comparison of the performances of state-of-the-art meta-heuristics on the association rule mining problem. We use the multi-objective versions of those algorithms using support, confidence and cosine. Second, we propose a new algorithm designed to mine rules efficiently from massive datasets by exploring a large variety of solutions, akin to the explosion of species diversity of the Cambrian Explosion. We compare our algorithm to 20 benchmark algorithms on 22 real-world data-sets, and show that our algorithm present good results and outperform several state-of-the-art algorithms.
翻译:协会规则采矿是数据采矿研究最多的研究领域之一,其应用范围从杂货篮子问题到高度可解释的分类系统不等。古老协会规则采矿算法有几个缺陷,特别是在执行时间、记忆使用和所制定的规则数量方面。另一种办法是使用元超常论,用于若干优化问题。本文有两个目标。首先,我们比较了在联合规则采矿问题上最先进的超常论的性能。我们利用支持、信任和 Cosine 来使用这些算法的多目标版本。第二,我们提出一种新的算法,通过探索大量多种解决办法来有效地从大规模数据集中开采规则,类似于Cambrian Exploration的物种多样性爆炸。我们将我们的算法与22个真实世界数据集的20个基准算法进行比较,并表明我们的算法取得了良好的结果,并超越了几个最先进的算法。