We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. Both datasets follow the same annotation procedure and can be used interchangeably as train and test splits for each other. NorDiaChange covers the time periods related to pre- and post-war events, oil and gas discovery in Norway, and technological developments. The annotation was done using the DURel framework and two large historical Norwegian corpora. NorDiaChange is published in full under a permissive licence, complete with raw annotation data and inferred diachronic word usage graphs (DWUGs).
翻译:我们描述NorDiaChange:挪威第一个对称语语义变换数据集。NorDia Change由两个新小子集组成,涵盖大约80个挪威名词,随时间推移以分级语义变换人工附加注释。两个数据集都遵循同样的批注程序,可以互换用作相互的火车和测试分解。NorDiaChange覆盖与战前和战后事件、挪威石油和天然气发现以及技术发展有关的时期。注解工作使用了DURel框架和两个挪威历史大公司。NorDia Change在许可下全文出版,完整地使用原始注解数据,并推断出对称词使用图(DWUGs )。