Skyline computation is an essential database operation that has many applications in multi-criteria decision making scenarios such as recommender systems. Existing algorithms have focused on checking point domination, which lack efficiency over large datasets. We propose a grid-based structure that enables grid cell domination checks. We show that only a small constant number of cells need to be checked which is independent from the number of data points. Our structure also enables parallel processing. We thus obtain a highly efficient parallel skyline algorithm named SkyCell, taking advantage of the parallelization power of graphics processing units. Experimental results confirm the effectiveness and efficiency of SkyCell -- it outperforms state-of-the-art algorithms consistently and by up to over two orders of magnitude in the computation time.
翻译:天线计算是一项基本数据库操作,在建议系统等多标准决策情景中有许多应用。现有的算法侧重于检查点控制,对大型数据集缺乏效率。我们提议了一个基于网格的结构,以便能够进行网格控制检查。我们显示,只需要检查少量与数据点数无关的固定数的单元格即可进行检查。我们的结构还允许平行处理。我们因此获得了一个高效的平行的平行天空线算法,名为SkyCell,利用图形处理器的平行能力。实验结果证实了SkyCell的效能和效率,它一贯地超过了最先进的算法,在计算时超过了两个数量级。