项目名称: 网络下的西夏文及西夏文献处理研究
项目编号: No.60803104
项目类型: 青年科学基金项目
立项/批准年度: 2009
项目学科: 金属学与金属工艺
项目作者: 柳长青
作者单位: 宁夏大学
项目金额: 19万元
中文摘要: 当前随着西夏学研究的不断深入,网络下的西夏文及西夏文献计算机数字化和文本化,并对数字化、文本化的文献进行查询与检索具有重要的意义。本课题主要利用计算机研究西夏文献数字化整理的解决方法和实现技术,最终实现西夏文献数字化资源平台。该平台可实现西夏文的网页显示和西夏文献的文本化及西夏文关键字在西夏文献数字化资源中的精确定位和字库、文献资源的快速更新与发布。通过本课题的研究可以探索少数民族古籍文献的计算机研究方法和计算机科学技术与人文社会科学相互交叉、相互结合的研究方法。本课题在已有的工作基础上,首先建立了西夏古籍字库,该字库中的西夏字字形完全来自西夏古籍文献,其每个西夏字形均来自于已公布的西夏文献,能够真实反映西夏字的本质特征,整体结构未经人为美化与修饰。该字库是目前唯一严格按照原始文献制作的西夏字库。还讨论了西夏字字形结构特点及与汉字的比较。通过建立智能西夏文四角号码输入法大大提高了西夏文的录入速度。利用计算机图形图像处理技术对《俄藏黑水城西夏文献》进行了图像预处理及切割操作。通过人工与计算机处理相结合的方法进行了西夏文献的文本化。最后利用西夏文献数据库实现了西夏文电子字典的应用实例。
中文关键词: 西夏文献; 数字化; 西夏文;西夏古籍字库;西夏数据库
英文摘要: As Xixia Studies has been well developed during past decades, the digitalization of Xixia characters and documents along with text-based literature index is of great significance. This project aims to achieve digitalization of Xixia documents and construct Xixia literature digital resources platform. The platform enables the Xixia web page display Xixia literature, search Xixia literature by using keyboard digital resources, precise positioning and font, enabling document resources quickly update and publish. Ethnic ancient books and documents, computer research methods and computer science and technology intersect with the humanities and social sciences through the study of this subject can explore the combination of research methods. This subject in the existing work based on the first established Xia Xia ancient fonts, Xixia characters in the font glyphs entirely from Xixia ancient literature, Xixia characters shaped from Xixia literature published, can truly reflect the essential characteristics of Xixia words, the overall structure without artificial beautification and modification. The font is produced only in strict accordance with the original literature of the Xixia font. Besides, the project also discusses Xixia characters shaped structural characteristics and comparison with the Chinese characters. Four-corner intelligent input method of Xixia characters greatly improves the entry speed of Xixia characters. Based on computer graphics image processing technology, Russian Collection Heishuicheng XiXia literature is processed through a combination of manual and computer processing. Finally, a Xixia electronic dictionary is complied by using Xixia historical documents database.
英文关键词: Xixia historical documents; digitalization; Xixia characters; Xixia ancient font; Xixia Database