项目名称: 基于WEB信息的信息错误自动检测与修复技术研究
项目编号: No.61502390
项目类型: 青年科学基金项目
立项/批准年度: 2016
项目学科: 自动化技术、计算机技术
项目作者: 刘海龙
作者单位: 西北工业大学
项目金额: 20万元
中文摘要: 信息质量已经成为诸多应用领域面临的一个重要问题,自动检测和修复信息系统中的信息错误是改善信息质量的有效手段。现有的基于规则的信息错误自动检测与修复技术过分依赖数据库中的信息,在相关信息信息量不足时无法确保能够发现有效的信息质量规则和进行准确的信息错误修复。利用WEB进行信息扩展以助于信息错误自动检测与修复可以克服上述不足。本项目将重点关注基于WEB信息的关系型信息错误自动检测与修复技术。拟构建基于WEB信息的信息扩展模型自动扩展关系型信息的信息量,在此基础上提出有效适用的信息错误自动检测与修复算法,基于WEB信息构建合理的评估模型对信息错误自动检测与修复算法的可靠性进行评估。为全面验证所提出的模型及方法的有效性,本项目将设计和开发一个信息错误自动检测与修复原型系统。我们期望本项目的研究能为改善关系数据库信息质量提供一种新的途径。
中文关键词: 信息质量;错误检测;数据修复;WEB
英文摘要: Information quality has become an important issue in many application areas. Automatically detecting and correcting information errors has proven to be an effective way to improve information quality in most information systems. Existing technologies are mostly rule-based and require adequate well-structured data in a database. They are unable to find perfect information quality rules and compelling correcting results when, as is often the case, the available data is insufficient. Introducing web information to help information error detection and correction is an effective way to overcome the shortcomings of existing techniques. This proposal focuses on the web-based technologies for automatic information error detection and correction. We will propose a unified web-based information expansion model to automatically extract additional information from the WEB for relational data, based on which we will present a set of effective information error detection and correction algorithms and a set of web-based reliability evaluation models for our proposed techniques. In order to evaluate the effectiveness of our technologies, we will build a demo system. We hope our research can help people find a new way to enhance the quality of information in their areas.
英文关键词: Information Quality;Error Detection ;Data Repairing;WEB