The rapid advance in artificial intelligence technology has facilitated the prosperity of digital humanities research. Against such backdrop, research methods need to be transformed in the intelligent processing of ancient texts, which is a crucial component of digital humanities research, so as to adapt to new development trends in the wave of AIGC. In this study, we propose a GPT model called SikuGPT based on the corpus of Siku Quanshu. The model's performance in tasks such as intralingual translation and text classification exceeds that of other GPT-type models aimed at processing ancient texts. SikuGPT's ability to process traditional Chinese ancient texts can help promote the organization of ancient information and knowledge services, as well as the international dissemination of Chinese ancient culture.
翻译:人工智能技术的快速进步为数字人文研究的繁荣提供了便利。在此背景下,需要转换研究方法,以便在智能处理古代文献方面适应AIGC浪潮的新发展趋势。本研究基于四库全书语料库提出了一种名为SikuGPT的GPT模型。在内语翻译和文本分类等任务中,该模型的性能优于其他旨在处理古代文献的GPT类型模型。SikuGPT处理传统的中国古代文献的优越能力有助于推动古代信息的组织和知识服务,以及中国古代文化的国际传播。