Reinforcement learning has recently been used to enhance index structures, giving rise to reinforcement learning-enhanced spatial indices (RLESIs) that aim to improve query efficiency during index construction. However, their practical benefits remain unclear due to the lack of unified implementations and comprehensive evaluations, especially in disk-based settings. We present the first modular and extensible benchmark for RLESIs. Built on top of an existing spatial index library, our framework decouples index training from building, supports parameter tuning, and enables consistent comparison with traditional, advanced, and learned spatial indices. We evaluate 12 representative spatial indices across six datasets and diverse workloads, including point, range, kNN, spatial join, and mixed read/write queries. Using latency, I/O, and index statistics as metrics, we find that while RLESIs can reduce query latency with tuning, they consistently underperform learned spatial indices and advanced variants in both query efficiency and index build cost. These findings highlight that although RLESIs offer promising architectural compatibility, their high tuning costs and limited generalization hinder practical adoption.
翻译:强化学习近期被用于增强索引结构,催生了旨在提升索引构建过程中查询效率的强化学习增强型空间索引。然而,由于缺乏统一的实现与全面的评估,尤其是在基于磁盘的环境中,其实际效益尚不明确。我们提出了首个模块化且可扩展的强化学习增强型空间索引基准测试框架。该框架基于现有空间索引库构建,将索引训练与构建过程解耦,支持参数调优,并能与传统、先进及学习型空间索引进行一致性比较。我们在六个数据集及多样化工作负载(包括点查询、范围查询、k近邻查询、空间连接查询以及混合读写查询)下评估了12种代表性空间索引。以延迟、I/O及索引统计量为衡量指标,研究发现:尽管强化学习增强型空间索引可通过调优降低查询延迟,但在查询效率与索引构建成本方面均持续逊色于学习型空间索引及其先进变体。这些结果表明,虽然强化学习增强型空间索引具备良好的架构兼容性前景,但其高昂的调优成本与有限的泛化能力阻碍了实际应用。