项目名称: 不确定数据流的分布并行Skyline查询技术研究
项目编号: No.61502511
项目类型: 青年科学基金项目
立项/批准年度: 2016
项目学科: 自动化技术、计算机技术
项目作者: 李小勇
作者单位: 中国人民解放军国防科技大学
项目金额: 20万元
中文摘要: 不确定数据流的Skyline查询作为大数据分析的一个重要方面,在环境监控、决策制定和数据挖掘等应用中发挥着重要作用。由于查询需求的不断扩大及其在计算过程和数据流更新方面的复杂性,已有的集中式查询方法难以满足现实应用对查询效率的需求。当前诸如数据中心等分布式计算环境的兴起和广泛运用,为实现不确定数据流的分布并行Skyline查询处理提供了有利条件。本项目拟针对不确定数据流开展分布并行Skyline查询技术的研究工作,重点围绕其在查询模型、高效性、容错性和灵活性四个方面的研究挑战展开研究。具体研究内容包括:弹性自适应的分布并行查询模型、基于三级优化的分布并行查询方法、基于复制的容错分布并行查询方法、支持并行区间树刺探查询的n-of-N概率Skyline查询方法。本项目旨在建立一套高效实现不确定数据流分布并行Skyline查询的理论与机制,提升我国在相关领域的研究水平和自主创新能力。
中文关键词: 分布并行查询;不确定数据流;轮廓查询;容错查询;并行模型
英文摘要: The skyline query over uncertain data streams, as an important aspect of big data analysis, plays an important role in fields like environment monitoring, decision-making and data mining. The expanding requirements and its complex in the computing process and streaming updating, make existing centralized methods be hard to meet the requirements of the query efficiency in real applications. The advance and widely used of current distributed computing environments like data centers, provides favorable conditions for realizing distributed parallel skyline queries over uncertain data streams. This project will carry out the work of distributed parallel skyline query technology for uncertain data streams, and primarily focus on addressing the problems of the query mode, efficiency, fault-tolerance and flexibility for the parallel queries. The specific contents of this research include the elastic and adaptive distributed parallel query model, the three-level optimization based distributed parallel skyline query approach, the replication based fault-tolerant distributed parallel skyline query approach, and the parallel interval stabbing query oriented n-of-N probabilistic skyline query approach. This project targets to establish a set of theories and mechanisms for realizing efficient distributed parallel skyline queries over uncertain data streams, in order to promote the research level and the independent innovational ability of our country in the related areas.
英文关键词: Distributed Parallel Queries;Uncertain Data Streams;Skyline Queries;Fault-Tolerant Queries;Parallel Model