Machine learning is rapidly being used in database research to improve the effectiveness of numerous tasks included but not limited to query optimization, workload scheduling, physical design, etc. Currently, the research focus has been on replacing a single database component responsible for one task by its learning-based counterpart. However, query performance is not simply determined by the performance of a single component, but by the cooperation of multiple ones. As such, learned based database components need to collaborate during both training and execution in order to develop policies that meet end performance goals. Thus, the paper attempts to address the question "Is it possible to design a database consisting of various learned components that cooperatively work to improve end-to-end query latency?". To answer this question, we introduce MADB (Multi-Agent DB), a proof-of-concept system that incorporates a learned query scheduler and a learned query optimizer. MADB leverages a cooperative multi-agent reinforcement learning approach that allows the two components to exchange the context of their decisions with each other and collaboratively work towards reducing the query latency. Preliminary results demonstrate that MADB can outperform the non-cooperative integration of learned components.
翻译:在数据库研究中迅速使用机器学习,以提高包括但不局限于查询优化、工作量安排、物理设计等在内的许多任务的效力。目前,研究重点是更换一个单一的数据库组成部分,由学习对应方负责一项任务。然而,查询的绩效并非简单地由单一组成部分的绩效决定,而是由多个组成部分的合作决定。因此,以学习为基础的数据库组成部分需要在培训和执行期间进行合作,以便制定达到最终业绩目标的政策。因此,文件试图解决“是否有可能设计一个由各种学习组成部分组成的数据库,由合作改进端到端的查询延迟度?”的问题。为了回答这个问题,我们引入了MADB(多边-代理DB),这是一个概念验证系统,其中包括一个学习的查询时间表和一个学习的查询优化器。MADB利用一个合作的多工具强化学习方法,使两个组成部分能够相互交流其决定的背景,并合作减少查询延迟。初步结果显示,MDADB可以超越所学组成部分的不合作的整合。