Data mining is a widely used technology for various real-life applications of data analytics and is important to discover valuable association rules in transaction databases. Interesting itemset mining plays an important role in many real-life applications, such as market, e-commerce, finance, and medical treatment. To date, various data mining algorithms based on frequent patterns have been widely studied, but there are a few algorithms that focus on mining infrequent or rare patterns. In some cases, infrequent or rare itemsets and rare association rules also play an important role in real-life applications. In this paper, we introduce a novel fuzzy-based rare itemset mining algorithm called FRI-Miner, which discovers valuable and interesting fuzzy rare itemsets in a quantitative database by applying fuzzy theory with linguistic meaning. Additionally, FRI-Miner utilizes the fuzzy-list structure to store important information and applies several pruning strategies to reduce the search space. The experimental results show that the proposed FRI-Miner algorithm can discover fewer and more interesting itemsets by considering the quantitative value in reality. Moreover, it significantly outperforms state-of-the-art algorithms in terms of effectiveness (w.r.t. different types of derived patterns) and efficiency (w.r.t. running time and memory usage).
翻译:数据开采是一种广泛应用的技术,用于数据分析的各种现实生活中的应用,对于在交易数据库中发现有价值的关联规则十分重要。有趣的物品集开采在许多实际生活中的应用中起着重要作用,例如市场、电子商务、金融和医疗。迄今为止,对基于经常模式的各种数据开采算法进行了广泛研究,但有一些侧重于不常见或罕见的采矿模式的算法。在某些情况下,不常见或稀有的物品和稀有的关联规则在实际生活中的应用中也起着重要作用。在本文中,我们引入了一种新型的、基于fuzzy的稀有物品集开采算法,称为FRI-Miner,它通过应用含语言含义的模糊理论,在数量数据库中发现宝贵和有趣的稀有物品。此外,FRI-Miner利用模糊列表结构储存重要信息,并采用若干调整战略来减少搜索空间。实验结果表明,拟议的FRI-Miner算法可以考虑到现实中的数量价值,发现越来越少、更有趣的物品。此外,它明显超越了一个定量数据库中的宝贵和有趣的稀有稀有的稀有物品。(在时间和时间的模型中) 。