Co-clustering is a specific type of clustering that addresses the problem of finding groups of objects without necessarily considering all attributes. This technique has shown to have more consistent results in high-dimensional sparse data than traditional clustering. In trajectory co-clustering, the methods found in the literature have two main limitations: first, the space and time dimensions have to be constrained by user-defined thresholds; second, elements (trajectory points) are clustered ignoring the trajectory sequence, assuming that the points are independent among them. To address the limitations above, we propose a new trajectory co-clustering method for mining semantic trajectory co-clusters. It simultaneously clusters the trajectories and their elements taking into account the order in which they appear. This new method uses the element frequency to identify candidate co-clusters. Besides, it uses an objective cost function that automatically drives the co-clustering process, avoiding the need for constraining dimensions. We evaluate the proposed approach using real-world a publicly available dataset. The experimental results show that our proposal finds frequent and meaningful contiguous sequences revealing mobility patterns, thereby the most relevant elements.
翻译:联合集群是一种特定类型的集群,它处理的是在不考虑所有属性的情况下寻找物体群的问题。这一技术显示,在高维分散数据方面比传统集群在传统集群中具有更为一致的结果。在轨迹联合集群中,文献中发现的方法有两个主要局限性:第一,空间和时间层面必须受到用户界定的阈值的限制;第二,元素(轨道点)群集不考虑轨迹序列,假设这些点相互独立。为解决上述局限性,我们提出了一种新的轨迹联合集群方法,用于开采语义轨迹共同集群。它同时将轨迹及其元素分组,同时考虑到它们出现的先后顺序。这一新方法使用元素频率来确定候选组合群;此外,它使用客观的成本功能,自动驱动联合集群进程,避免限制尺寸的需要。我们用现实世界的一个公开数据集来评估拟议的方法。实验结果表明,我们的建议发现频繁和有意义的毗连序列揭示了流动模式,从而发现最相关的元素。