项目名称: 海量数据压缩加密存储技术研究
项目编号: No.10876012
项目类型: 面上项目
立项/批准年度: 2009
项目学科: 无线电电子学、电信技术
项目作者: 路松峰
作者单位: 华中科技大学
项目金额: 30万元
中文摘要: 涉密单位需要对海量数据进行压缩加密存储,并需为其中的文本数据提供压缩密态下的安全查询及访问功能。为此,本项目首先提出海量数据压缩加密存储的整体方案;在整体框架的基础上提出了几种基于压缩和加密的索引结构;基于整体框架和索引结构,提出了可以屏蔽原始文档细节的中间XML格式文档结构;研究了适合中文XML文档和文本文档的压缩算法,研究了适合课题环境的支持内积的谓语加密算法;基于索引结构研究了适合压缩数据的检索算法和基于关键字组合的安全密文数据检索算法;同时研究了适合海量数据的文档分片加密和存储策略。基于研究内容构造了一个验证原型系统,并对原型系统进行了工程测试和分析。课题的研究将为压缩加密存储和压缩密文检索研究提供理论基础;此外,课题的研究成果还可直接使用在数字档案馆、数据备份、数据交换、搜索引擎、隐私保持的数据挖掘等领域,具有广阔的应用前景。
中文关键词: 数据压缩;压缩数据检索;密文数据检索;索引结构
英文摘要: The enterprises and institutions, which need to protect their information, require their massive data not only to be compressed and encrypted before its storage but also to secure query and access on text data in its compressed and encrypted format without decompression and decryption to keep their secrets. Therefore, a scheme for the project is presented, and then four index structures supporting compression or encrtption retrieval are proposed.To not consider the input file format, the intermediate XML format is presented. In the project, the compression algorithm for Chinese documents, the predicate encryption algorithm supporting inner product, and the retrieval algorithms for compressed data and encrypted data are presented. File slice encryption and storage model is also researched. Based on the above researches, We developed a prototype system, in the meantime we also analysed and tested the developed system. This project will provide a theoretical basis for the related research on data storage and search under compression and encryption. In addition, the research achievement has broad prospects, which can also be applied directly in the digital archive, data backup, data exchange, search engine, privacy-safe data mining and other fields.
英文关键词: data compression; compressed data retrieval; ciphertext data retrieval; indexed structure