基于互联网的汉维科技术语提取技术研究 - 专知基金

会员服务 ·

0

可比语料库 · 双语对齐 · 汉语-维吾尔语 ·

2014 年 12 月 31 日

基于互联网的汉维科技术语提取技术研究

国家自然科学基金

国家自然科学基金委员会

项目名称： 基于互联网的汉维科技术语提取技术研究

项目编号： No.61463048

项目类型： 地区科学基金项目

立项/批准年度： 2015

项目学科： 其他

项目作者： 米尔夏提·力提甫

作者单位： 新疆大学

项目金额： 45万元

中文摘要： 术语(terms)集中承载特定领域的核心知识，术语自动抽取能够帮助人们便捷地获得和认识领域知识，而双语术语则充分体现了语言间的映射和对应关系，在自然语言处理中具有重要地位。本项在目前期预研的基础上，构建面向科技领域的汉维可比语料库，研究实用的基于可比语料的汉维双语术语抽取方法、汉维双语语料自动获取方法、维汉语料篇章级自动对齐方法,基于规则的维吾尔语术语识别以及抽取混合方法,研制基于互联网语料的汉维双语术语抽取原型系统，构建面向科技领域的汉语-维吾尔语双语新术语资源库，抽取和编纂科技领域的汉语-维吾尔语双语对齐新术语词典为汉维机器翻译、跨语言信息检索提供支持，促进新疆科技事业的发展和信息化建设进程。

中文关键词： 术语；可比语料库；双语对齐；汉语-维吾尔语

英文摘要： The concentration of terms carries the core knowledge of a particular field. Automatically extraction of terms can help people to access and understand the field of knowledge in a convenient and fast way. More over, bilingual terminology fully reflects the mapping and corresponding relations between the languages, and it plays an important role in the natural language processing. In this project, on the basis of pre-research, we will build science and technology-oriented Chinese-Uyghur comparable corpus to study practical method of comparable corpus based Chinese-Uyghur bilingual term extraction, method of Chinese-Uyghur Automatic corpus extraction, method of Chinese-Uyghur article level automatic alignment and hybrid approach of rule based Uyghur term detection and extraction. Develop Internet based Chinese-Uyghur extraction prototype system, build new term repository, extract and compile science and technology oriented Chinese-Uyghur bilingual new term dictionary to support Chinese-Uyghur machine translation, cross language information retrieval and advance the development of science , technology and information construction of Xinjiang.

英文关键词： Terminology;Comparable Corpus;Bilingual Alignment;Chinese-Uyghur

成为VIP会员查看完整内容

0

相关内容

可比语料库

可比语料库

知识图谱研究现状及军事应用

知识图谱研究现状及军事应用

专知会员服务

199+阅读 · 2022年4月8日

军事知识图谱构建技术

军事知识图谱构建技术

专知会员服务

140+阅读 · 2022年4月8日

《金融大数据术语》行业标准，24页pdf

《金融大数据术语》行业标准，24页pdf

专知会员服务

55+阅读 · 2022年2月28日

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

专知会员服务

256+阅读 · 2022年2月19日

央行发布《金融大数据术语》，25页pdf

央行发布《金融大数据术语》，25页pdf

专知会员服务

43+阅读 · 2022年1月25日

面向网络空间安全情报的知识图谱综述

专知会员服务

117+阅读 · 2021年1月8日

企业风险知识图谱的构建及应用

企业风险知识图谱的构建及应用

专知会员服务

98+阅读 · 2020年11月6日

面向知识图谱的信息抽取

专知会员服务

202+阅读 · 2020年10月14日

基于迁移学习的细粒度实体分类方法的研究

专知会员服务

32+阅读 · 2020年9月2日

中文知识图谱构建技术以及应用的综述

中文知识图谱构建技术以及应用的综述

专知会员服务

317+阅读 · 2019年10月19日

《金融大数据术语》行业标准，24页pdf

《金融大数据术语》行业标准，24页pdf

专知

1+阅读 · 2022年2月28日

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

新智元

3+阅读 · 2022年2月20日

央行发布《金融大数据术语》标准，25页pdf

央行发布《金融大数据术语》标准，25页pdf

专知

0+阅读 · 2022年1月25日

OpenKG开源系列 | 大规模中文概念图谱OpenConcepts (浙江大学)

OpenKG开源系列 | 大规模中文概念图谱OpenConcepts (浙江大学)

开放知识图谱

1+阅读 · 2021年7月15日

面向新闻媒体的命名实体识别技术

面向新闻媒体的命名实体识别技术

PaperWeekly

18+阅读 · 2019年4月17日

连载 | 知识图谱发展报告 2018 -- 前言

连载 | 知识图谱发展报告 2018 -- 前言

开放知识图谱

18+阅读 · 2018年10月7日

【知识图谱】一个有效的知识图谱是如何构建的？

【知识图谱】一个有效的知识图谱是如何构建的？

产业智能官

57+阅读 · 2018年4月5日

领域应用 | 中医临床知识图谱的构建与应用

领域应用 | 中医临床知识图谱的构建与应用

开放知识图谱

33+阅读 · 2017年12月12日

综述 | 知识图谱发展概述

综述 | 知识图谱发展概述

PaperWeekly

76+阅读 · 2017年11月3日

漆桂林 | 知识图谱之语义网络篇

漆桂林 | 知识图谱之语义网络篇

开放知识图谱

19+阅读 · 2017年8月12日

基于EHR结构模型和DCM的医学术语协同化方法研究

国家自然科学基金

4+阅读 · 2014年12月31日

中文句子语义概念图自动构建方法及应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于互联网海量信息的数据库文本类型数据清洗研究

国家自然科学基金

1+阅读 · 2013年12月31日

面向科技监测的实体识别与关系抽取研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于FrameNet的中文评价词汇本体构建与观点挖掘研究

国家自然科学基金

1+阅读 · 2013年12月31日

中文领域本体学习及半自动构建方法研究

国家自然科学基金

3+阅读 · 2012年12月31日

互联网环境下中文实体知识挖掘关键技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

基于数据驱动的中文自然语言生成关键技术研究

国家自然科学基金

7+阅读 · 2012年12月31日

基于本体的多策略民汉机器翻译研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于类格的多层网页分类技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

Event Transition Planning for Open-ended Text Generation

Arxiv

0+阅读 · 2022年4月20日

Representation of short distances in structurally sparse graphs

Arxiv

0+阅读 · 2022年4月19日

The signature and cusp geometry of hyperbolic knots

Arxiv

0+阅读 · 2022年4月19日

Mixture of Experts for Biomedical Question Answering

Arxiv

0+阅读 · 2022年4月15日

On Scheduling Mechanisms Beyond the Worst Case

Arxiv

0+阅读 · 2022年4月14日

Multi-Modal Knowledge Graph Construction and Application: A Survey

Arxiv

79+阅读 · 2022年2月11日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

阅读: 0 点赞: 0

小贴士

登录享主题订阅及个性化推荐

相关主题

可比语料库

汉语-维吾尔语

热门VIP内容

开通专知VIP会员享更多权益服务

【斯坦福博士论文】数据、决策与过度依赖：构建可信人工智能的核心挑战

《多域时代中维持弹性军事训练：挑战与机遇》

【AAAI2026】专家数量何为最优？面向混合专家模型的语义专业化优化研究

自进化人工智能体的全面综述：连接基础模型与终身自主智能系统的新范式

相关VIP内容

知识图谱研究现状及军事应用

知识图谱研究现状及军事应用

专知会员服务

199+阅读 · 2022年4月8日

军事知识图谱构建技术

军事知识图谱构建技术

专知会员服务

140+阅读 · 2022年4月8日

《金融大数据术语》行业标准，24页pdf

《金融大数据术语》行业标准，24页pdf

专知会员服务

55+阅读 · 2022年2月28日

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

专知会员服务

256+阅读 · 2022年2月19日

央行发布《金融大数据术语》，25页pdf

央行发布《金融大数据术语》，25页pdf

专知会员服务

43+阅读 · 2022年1月25日

面向网络空间安全情报的知识图谱综述

专知会员服务

117+阅读 · 2021年1月8日

企业风险知识图谱的构建及应用

企业风险知识图谱的构建及应用

专知会员服务

98+阅读 · 2020年11月6日

面向知识图谱的信息抽取

专知会员服务

202+阅读 · 2020年10月14日

基于迁移学习的细粒度实体分类方法的研究

专知会员服务

32+阅读 · 2020年9月2日

中文知识图谱构建技术以及应用的综述

中文知识图谱构建技术以及应用的综述

专知会员服务

317+阅读 · 2019年10月19日

相关资讯

《金融大数据术语》行业标准，24页pdf

《金融大数据术语》行业标准，24页pdf

专知

1+阅读 · 2022年2月28日

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

重磅！国家标准《信息技术人工智能知识图谱技术框架》征求意见稿发布，35页pdf详细规定知识图谱技术框架

新智元

3+阅读 · 2022年2月20日

央行发布《金融大数据术语》标准，25页pdf

央行发布《金融大数据术语》标准，25页pdf

专知

0+阅读 · 2022年1月25日

OpenKG开源系列 | 大规模中文概念图谱OpenConcepts (浙江大学)

OpenKG开源系列 | 大规模中文概念图谱OpenConcepts (浙江大学)

开放知识图谱

1+阅读 · 2021年7月15日

面向新闻媒体的命名实体识别技术

面向新闻媒体的命名实体识别技术

PaperWeekly

18+阅读 · 2019年4月17日

连载 | 知识图谱发展报告 2018 -- 前言

连载 | 知识图谱发展报告 2018 -- 前言

开放知识图谱

18+阅读 · 2018年10月7日

【知识图谱】一个有效的知识图谱是如何构建的？

【知识图谱】一个有效的知识图谱是如何构建的？

产业智能官

57+阅读 · 2018年4月5日

领域应用 | 中医临床知识图谱的构建与应用

领域应用 | 中医临床知识图谱的构建与应用

开放知识图谱

33+阅读 · 2017年12月12日

综述 | 知识图谱发展概述

综述 | 知识图谱发展概述

PaperWeekly

76+阅读 · 2017年11月3日

漆桂林 | 知识图谱之语义网络篇

漆桂林 | 知识图谱之语义网络篇

开放知识图谱

19+阅读 · 2017年8月12日

相关基金

基于EHR结构模型和DCM的医学术语协同化方法研究

国家自然科学基金

4+阅读 · 2014年12月31日

中文句子语义概念图自动构建方法及应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

基于互联网海量信息的数据库文本类型数据清洗研究

国家自然科学基金

1+阅读 · 2013年12月31日

面向科技监测的实体识别与关系抽取研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于FrameNet的中文评价词汇本体构建与观点挖掘研究

国家自然科学基金

1+阅读 · 2013年12月31日

中文领域本体学习及半自动构建方法研究

国家自然科学基金

3+阅读 · 2012年12月31日

互联网环境下中文实体知识挖掘关键技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

基于数据驱动的中文自然语言生成关键技术研究

国家自然科学基金

7+阅读 · 2012年12月31日

基于本体的多策略民汉机器翻译研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于类格的多层网页分类技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

相关论文

Event Transition Planning for Open-ended Text Generation

Arxiv

0+阅读 · 2022年4月20日

Representation of short distances in structurally sparse graphs

Arxiv

0+阅读 · 2022年4月19日

The signature and cusp geometry of hyperbolic knots

Arxiv

0+阅读 · 2022年4月19日

Mixture of Experts for Biomedical Question Answering

Arxiv

0+阅读 · 2022年4月15日

On Scheduling Mechanisms Beyond the Worst Case

Arxiv

0+阅读 · 2022年4月14日

Multi-Modal Knowledge Graph Construction and Application: A Survey

Arxiv

79+阅读 · 2022年2月11日

Relating Graph Neural Networks to Structural Causal Models

Arxiv

44+阅读 · 2021年9月9日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

微信扫码咨询专知VIP会员