项目名称: 短文本的精确语义感知与多分类研究
项目编号: No.61300094
项目类型: 青年科学基金项目
立项/批准年度: 2014
项目学科: 自动化技术、计算机技术
项目作者: 周尔强
作者单位: 电子科技大学
项目金额: 25万元
中文摘要: 对文本的语义研究是自然语言处理以及人工智能的难点与热点问题。本项目首次将"精确语义推理"作为研究重点,从语义、精确推理以及多分类三个方面对短文本感知展开全面的研究。主要研究内容包括:①语义研究,即从词汇语义学、认知语言学、认知学等语言学领域以及计算机领域的知识工程,本体知识库等入手,研究探索从哪个方面或者用哪种方法可以更好地将词义(即词汇所蕴含的知识)用一种形式化或计算机化的方法表示出来;②精确推理研究,即研究在什么条件下、应该用或者不应该知识库中的哪些知识,以及研究如何使用语义知识、各种语义知识在使用时是否有共性、有哪些共性等;③多分类研究:即在语义以及精确推理机制的支持下,研究如何将文本按照不同的维度,即文本的社会属性,如:主题类别、传播等级、情感极性以及政治敏感性等,和不同的层次,如篇张、段落和句子,进行分类。
中文关键词: 语义;知识库;短文本;多分类;
英文摘要: The research of text semantics is a difficult but hot issue in the field of Natural Language Processing and Artificial Intelligence. This proposal is one of the first attempts where semantic and programmatic information of texts is incorporated so that the goal of precise reasoning according to these information can be undertaken. Three aspects (semantics, precise reasoning and multi-classification) will be addressed in this proposal: 1) The research of semantics will summarize the state-of-the-art in the field of linguistics, including lexical semantics, cognitive linguistics, cognitive science, and the field of computer knowledge engineering as well as ontology research. The task of semantics research will investigate and answer the question that on which aspects or by what methods can the meaning of words(the knowledge hiding behind) be better expressed in a formal or computerized way. 2) The research of precise reasoning will study on what condition a certain piece of knowledge in the knowledge base should be (or not be) used, and answer whether different semantic rules have some properties in common. 3) The research of text multi-classification will investigate the "social attributes" of texts and attack the problem of how to classify texts from different dimensions (such as: topic labels, propagation abili
英文关键词: Semantics;Knowledge Base;Short Texts;Multi-Classification;