A groundswell of recent work has focused on improving data management systems with learned components. Specifically, work on learned index structures has proposed replacing traditional index structures, such as B-trees, with learned models. Given the decades of research committed to improving index structures, there is significant skepticism about whether learned indexes actually outperform state-of-the-art implementations of traditional structures on real-world data. To answer this question, we propose a new benchmarking framework that comes with a variety of real-world datasets and baseline implementations to compare against. We also show preliminary results for selected index structures, and find that learned models indeed often outperform state-of-the-art implementations, and are therefore a promising direction for future research.
翻译:最近的工作大有进展,重点是改进数据管理系统,包括学习内容。具体地说,关于学习指数结构的工作建议用学习模式取代传统的指数结构,如B-树。鉴于致力于改善指数结构的数十年研究,人们对于学习指数是否实际上优于执行关于现实世界数据的传统结构的先进程度存在重大怀疑。为了回答这个问题,我们提出了一个新的基准框架,其中包含各种真实世界数据集和基准执行,以便进行比较。我们还展示了选定指数结构的初步结果,并发现学习模式确实确实比最新执行标准要好,因此是未来研究的一个有希望的方向。