Nowadays, graph becomes an increasingly popular model in many real applications. The efficiency of graph storage is crucial for these applications. Generally speaking, the tune tasks of graph storage rely on the database administrators (DBAs) to find the best graph storage. However, DBAs make the tune decisions by mainly relying on their experiences and intuition. Due to the limitations of DBAs's experiences, the tunes may have an uncertain performance and conduct worse efficiency. In this paper, we observe that an estimator of graph workload has the potential to guarantee the performance of tune operations. Unfortunately, because of the complex characteristics of graph evaluation task, there exists no mature estimator for graph workload. We formulate the evaluation task of graph workload as a classification task and carefully design the feature engineering process, including graph data features, graph workload features and graph storage features. Considering the complex features of graph and the huge time consumption in graph workload execution, it is difficult for the graph workload estimator to obtain enough training set. So, we propose an active auto-estimator (AAE) for the graph workload evaluation by combining the active learning and deep learning. AAE could achieve good evaluation efficiency with limited training set. We test the time efficiency and evaluation accuracy of AAE with two open source graph data, LDBC and Freebase. Experimental results show that our estimator could efficiently complete the graph workload evaluation in milliseconds.
翻译:目前,图表在许多实际应用中成为一个越来越受欢迎的模型。图形存储的效率对于这些应用至关重要。一般来说,图形存储的调控任务依靠数据库管理员(DBAs)找到最佳的图表存储。然而,DBA主要依靠他们的经验和直觉来做出调控决定。由于DBA经验的局限性,这些调调可能具有不确定的性能,而且效率更差。在本文中,我们观察到,图表工作量估算员有潜力保证调控操作的运行。不幸的是,由于图表评估任务的复杂特点,没有成熟的图表工作量估算员。我们把图表工作量评估任务作为分类任务,并仔细设计特征工程流程,包括图表数据特征、图表工作量特征和图表存储特征。考虑到图表的复杂特点以及图表工作量执行过程中的大量时间消耗,因此,图表工作量估算员很难获得足够的培训。因此,我们提议为图表工作量评估设立一个积极的自动估算员(AE),将积极学习和深度学习的图表工作量计算结果结合起来。我们把图表工作量评估任务设计成一个成熟的图表,AEDLA 和两个测试源的精度的精度,我们可以实现ASTA和A的精度的实验室的精度的精度评估。