项目名称: 分布式不确定skyline查询处理关键技术研究
项目编号: No.61472126
项目类型: 面上项目
立项/批准年度: 2015
项目学科: 自动化技术、计算机技术
项目作者: 周炎涛
作者单位: 湖南大学
项目金额: 82万元
中文摘要: 数据呈现出海量性和不确定性,大规模不确定数据的有效管理已成为当前信息学科研究最活跃的领域之一。不确定skyline查询是数据管理中的重要研究技术,在决策支持、数据挖掘和环境监控等方面的应用极为普遍。当前不确定skyline查询的相关研究成果大多面向集中式数据库,对于分布式数据库的研究还比较少。本课题中,我们将研究分布式不确定skyline查询处理关键技术。首先针对不同的数据类型及应用场景,分别提出不确定skyline查询模型和top k不确定skyline查询、不确定k支配skyline查询的新模型;其次,分别设计高效的空间和概率剪枝策略,降低问题的计算复杂度;随后,引入模糊数学、博弈论中的算法设计技术,分别设计以上问题的分布式算法。并根据通信成本及执行时间两大性能指标对算法进行分析。本课题的研究不仅能够为不确定skyline查询算法的设计提供新思路, 还将丰富传统分布式数据处理的研究内容。
中文关键词: skyline查询;并行与分布式计算;不确定数据;大数据集;数据管理
英文摘要: With the new data model such as Social Networking, Location-based services and so on finding their way into our life,data's characters which are uncertain and massive volume have been paied extensive attention.How to manage the abundant and uncertain data effectively has been one of the most hop topics in information science.As an important problem of data management, uncertain skyline query is applied widely in the area of decision support,data mining and environmental monitoring.However, most of the current research fruits about uncertain skyline query are for centralized databases. It is lack of attention on the distributed uncertain skyline query technology. Hence,in this topic we will study the key technologies of distributed uncertain skyline query.Fistsly, we will propose new models for the uncertain skyline query and its two extended problems, top k uncertain skyline query and uncertain k-Dominant skyline query, according to different data types and application scenarios respectively. Secondly, effective spatial and probabilistic pruning strategies will be given to reduct the problem's computational complexity.Then the classical algorithm design strategies of the fuzzy mathmatical and the game theory will be introduced to the distributed algorithms for the problems above respectively. Furthermore, we will analyze the presented algorithms' communication cost and the execution time. It is worth to notice that this study will not only provide new insights into uncertain skyline query, but also enrich the research content of data managemen.
英文关键词: skyline query;parallel and distributed computing;uncertain data;large data set;data management