项目名称: 面向Web的中文模糊地名自动识别与近似地理范围估算
项目编号: No.41201405
项目类型: 青年科学基金项目
立项/批准年度: 2013
项目学科: 地理学
项目作者: 陈旭
作者单位: 武汉大学
项目金额: 25万元
中文摘要: 基于人工方式构建中文地名词典,耗时长、地名数目规模受限,无法满足Web环境下地理信息获取服务对地名的需求。本项目研究面向Web的中文模糊地名自动识别与近似地理范围估算,利用面向地名主题信息的协同聚焦爬取方法,从多源海量Web信息中爬取模糊地名及关联地名网页信息,进一步利用规则与统计相结合的模糊中文地名分级识别策略,有效的提取模糊地名及其关联地名信息,最终基于空间扫描统计的方法完成模糊地名近似地理覆盖范围估算。本项目研究Web环境下地名自动获取的新问题,其成果可应用各类网络空间信息系统,具有重要的理论研究价值与应用前景。
中文关键词: 地名;识别;地理范围;;
英文摘要: Building gazetteer by labour is a hard woking ,which is time-consuming and the scale of gazetteer is limited, that can not satisfy the requirement of geographic information retrieval based on Web. So we research on web-based automatic identifying of a chinese vague toponym and the approximate footprint estimate. We use geographically focused collaborative crawling for acquiring web page with chinese vague toponym and associated place names from mulit-source information.Further, we use a hierarchical strategy which a combination of rules and statistics for identifying chinese vague toponym. Finally, spatial scan statistic-based approach is used to estimate the approximate geographic coverage of chinese vague toponym. This project research on the new problem about the obtaining of toponym based on Web, the results can be applied to various WebGIS application, which has important theoretical value and prospects.
英文关键词: toponym;recognition;geographical coverage;;