Negative sequential pattern mining (SPM) is an important SPM research topic. Unlike positive SPM, negative SPM can discover events that should have occurred but have not occurred, and it can be used for financial risk management and fraud detection. However, existing methods generally ignore the repetitions of the pattern and do not consider gap constraints, which can lead to mining results containing a large number of patterns that users are not interested in. To solve this problem, this paper discovers frequent one-off negative sequential patterns (ONPs). This problem has the following two characteristics. First, the support is calculated under the one-off condition, which means that any character in the sequence can only be used once at most. Second, the gap constraint can be given by the user. To efficiently mine patterns, this paper proposes the ONP-Miner algorithm, which employs depth-first and backtracking strategies to calculate the support. Therefore, ONP-Miner can effectively avoid creating redundant nodes and parent-child relationships. Moreover, to effectively reduce the number of candidate patterns, ONP-Miner uses pattern join and pruning strategies to generate and further prune the candidate patterns, respectively. Experimental results show that ONP-Miner not only improves the mining efficiency, but also has better mining performance than the state-of-the-art algorithms. More importantly, ONP mining can find more interesting patterns in traffic volume data to predict future traffic.
翻译:消极的顺序型采矿(SPM)是一个重要的SPM研究专题。与积极的SPM不同的是,消极的SPM可以发现本应该发生但没有发生的事件,并且可以用于金融风险管理和欺诈检测。然而,现有的方法一般忽视模式的重复,不考虑差距限制,这可能导致采矿结果,包含大量用户不感兴趣的模式。为了解决这个问题,本文件发现经常出现一次性的负面顺序型模式。这个问题有以下两个特点。首先,根据一次性条件计算支持,这意味着序列中的任何字符只能使用一次。第二,用户可以提供差距限制。为了高效地开采模式,本文建议了ONP-Miner算法,它使用深度第一和背轨战略来计算支持。因此,ONP-Miner可以有效地避免产生多余的节点和亲子关系。此外,为了有效减少候选模式的数量,ONP-Min使用模式加入和运行战略来生成和进一步运行该序列中的任何字符串联,这意味着只能最多使用一次。第二,用户可以提供差距限制。为了高效的采矿模式,而实验性结果也只能更准确地显示未来的运行效率。