We introduce a new Slovak masked language model called SlovakBERT in this paper. It is the first Slovak-only transformers-based model trained on a sizeable corpus. We evaluate the model on several NLP tasks and achieve state-of-the-art results. We publish the masked language model, as well as the subsequently fine-tuned models for part-of-speech tagging, sentiment analysis and semantic textual similarity.
翻译:在本文中,我们引入了一个新的斯洛伐克面罩语言模式,名为斯洛伐克语。这是第一个斯洛伐克语的变压器模式,其基础是大量材料。我们评估了斯洛伐克语的变压器模式,并取得了最先进的成果。我们出版了面具语言模式,以及随后对部分语音标记、情绪分析和语义文本相似性进行微调的模型。