High utility itemset mining approaches discover hidden patterns from large amounts of temporal data. However, an inescapable problem of high utility itemset mining is that its discovered results hide the quantities of patterns, which causes poor interpretability. The results only reflect the shopping trends of customers, which cannot help decision makers quantify collected information. In linguistic terms, computers use mathematical or programming languages that are precisely formalized, but the language used by humans is always ambiguous. In this paper, we propose a novel one-phase temporal fuzzy utility itemset mining approach called TFUM. It revises temporal fuzzy-lists to maintain less but major information about potential high temporal fuzzy utility itemsets in memory, and then discovers a complete set of real interesting patterns in a short time. In particular, the remaining measure is the first adopted in the temporal fuzzy utility itemset mining domain in this paper. The remaining maximal temporal fuzzy utility is a tighter and stronger upper bound than that of previous studies adopted. Hence, it plays an important role in pruning the search space in TFUM. Finally, we also evaluate the efficiency and effectiveness of TFUM on various datasets. Extensive experimental results indicate that TFUM outperforms the state-of-the-art algorithms in terms of runtime cost, memory usage, and scalability. In addition, experiments prove that the remaining measure can significantly prune unnecessary candidates during mining.
翻译:高效用项目设定采矿方法从大量时间数据中发现了隐藏的隐蔽模式。然而,高效用项目设定采矿的不可避免问题是,其发现的结果隐藏了大量模式,造成解释不力。结果仅反映客户的购物趋势,无法帮助决策者量化所收集的信息。在语言方面,计算机使用数学或编程语言,这些语言非常正规化,但人类使用的语言总是模糊不清。在本文件中,我们提议了一个名为TFUM的新颖的一阶段时间模糊的不必要用途项目设置采矿方法。它修改时间模糊清单,以在记忆中保存较少但重要的潜在高时空模糊的公用事业项目信息,然后在短时间内发现一套完整的真正有趣的模式。特别是,剩下的措施是本文中在时间模糊的公用事业项目设置采矿领域首次采用的语言,但人类使用的语言总是模糊不清。剩下的最高级时间模糊的效用比以往研究采用的更紧、更严格。因此,它在TFUMUM的搜索空间运行过程中发挥了重要作用。最后,我们还评估了采矿实验周期期间的各种实验结果。