Association Rule Mining is a machine learning method for discovering the interesting relations between the attributes in a huge transaction database. Typically, algorithms for Association Rule Mining generate a huge number of association rules, from which it is hard to extract structured knowledge and present this automatically in a form that would be suitable for the user. Recently, an information cartography has been proposed for creating structured summaries of information and visualizing with methodology called "metro maps". This was applied to several problem domains, where pattern mining was necessary. The aim of this study is to develop a method for automatic creation of metro maps of information obtained by Association Rule Mining and, thus, spread its applicability to the other machine learning methods. Although the proposed method consists of multiple steps, its core presents metro map construction that is defined in the study as an optimization problem, which is solved using an evolutionary algorithm. Finally, this was applied to four well-known UCI Machine Learning datasets and one sport dataset. Visualizing the resulted metro maps not only justifies that this is a suitable tool for presenting structured knowledge hidden in data, but also that they can tell stories to users.
翻译:规则采矿协会是一个在庞大的交易数据库中发现各种属性之间令人感兴趣的关系的机械学习方法。 通常, 规则采矿协会的算法产生大量的联合规则, 很难从中提取结构化知识, 并且以适合用户的方式自动提出。 最近, 提出了一个信息制图建议, 以“ 气象地图” 的方法来创建结构化的信息摘要和可视化。 这个方法被应用于几个问题领域, 需要模式采矿。 本研究的目的是开发一种方法, 自动绘制由协会规则采矿公司获得的信息的地铁地图, 从而将其推广到其他机器学习方法。 虽然拟议的方法由多个步骤组成, 但其核心显示地铁地图的构造在研究中被定义为优化问题, 使用进化算法加以解决 。 最后, 这应用于四个众所周知的 UCI 机器学习数据集和一个运动数据集。 将结果的地铁地图进行视觉化不仅说明这是向用户讲述数据中隐藏的结构化知识的适当工具, 而且还可以向用户讲述故事。