This paper presents results of a work on crawling CEUR Workshop proceedings web site to a Linked Open Data (LOD) dataset in the framework of ESWC 2014 Semantic Publishing Challenge 2014. Our approach is based on using an extensible template-dependent crawler and DBpedia for linking extracted entities, such as the names of universities and countries.
翻译:本文件介绍了在2014年ESWC语义出版挑战框架内,将CEUR研讨会记录网站与链接开放数据数据集连接起来的工作成果。 我们采用的方法是使用可扩展的基于模板的爬行器和DBpedia连接抽取实体,如大学和国家的名称。