项目名称: 基于微博社区的知识图谱构建与分析
项目编号: No.61472329
项目类型: 面上项目
立项/批准年度: 2015
项目学科: 其他
项目作者: 杜亚军
作者单位: 西华大学
项目金额: 82万元
中文摘要: 传统搜索引擎需要用户从返回网页中提炼有用知识;社交网络搜索利用人物的社会关系、共同爱好,提供人物和兴趣间的关系等方面的搜索结果。当前,社交网络搜索主要存在两个问题:第一,不能从语义上理解用户查询词;第二,仅局限于人物、兴趣搜索,限制了查询范围。另一方面,微博已成为社交网络的重要平台,为解决微博搜索中这两个问题和主动返回更多知识,本项目研究微博社区的知识图谱构建与分析,重点研究:微博社区中概念提取,概念包括人物、事物、地点、事件和话题等5种类型;微博社区概念间的关系提取,关系包括上述五种概念间的组合关系;知识图谱是带有语义的网络图谱,其将概念作为顶点并将概念间关系作为边,研究图谱的构建方法;微博社区知识图谱分析,包括构建效果、演化特征、应用效果分析;研发基于微博知识图谱的应用系统。预期获得微博社区知识图谱构建及应用的新思想、新方法、新技术、新系统。项目研究具有重要的理论意义和广阔的应用前景。
中文关键词: 网络信息检索;数据挖掘;搜索引擎;社会网络
英文摘要: Search engine only returns the Web page set for the user queries, it needs the user refine useful knowledge from it; Social Network Search (SNS) directly provides people and their interest to users by using characters' social relations and common hobbies. However, the SNS mainly exists two unresolved problems. On the one hand, the SNS can't semantically understand user queries submitted by users. On the other hand, the SNS only provides people search and interest search, and confines query domains for users. Microblog has become an important platform for social network. To address these problems of information retrieval about microblog and provide more knowledge for user queries, this project researches knowledge graph construction and analysis based on the microblog community. The project focuses on five contents: (1)It researches concept extractions for the microblog community, and concepts have five types including people, things, locations, events and topics; (2)It researches relationships extractions for the microblog community. The relationships among concepts include collection types formed by combining two arbitrary types above concepts; (3)It researches knowledge graph construction, and the knowledge graph is a semantic network graph which takes concepts and relationships respectively as vertices and edges; (4)It researches knowledge graph analysis. It includes construction effect analysis, evolution characteristics and rules analysis and application effect analysis. (5)It researches the application interface and system based the knowledge graph. By researching, we expect to acquire a series of new ideas, new methods, new technologies and new systems with constructing the graph knowledge based on the microblog community for its information retrieval. In IR fields, this research project has important significances in theories and broad prospects in applications.
英文关键词: Web Information Retrieval;Data Mining;Search Engine;social Network