The paper presents an open-domain Question Answering system for Romanian, answering COVID-19 related questions. The QA system pipeline involves automatic question processing, automatic query generation, web searching for the top 10 most relevant documents and answer extraction using a fine-tuned BERT model for Extractive QA, trained on a COVID-19 data set that we have manually created. The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.
翻译:该文件介绍了罗马尼亚的开放域问答系统,回答了COVID-19相关问题,质量保证系统管道包括自动处理问题、自动生成查询、网上搜索前10个最相关的文件,以及使用经精细调整的BERT " 提取质量评估 " 模型进行回答提取,该模型是我们手动创建的一套COVID-19数据集的培训,将介绍质量保证系统及其与罗马尼亚语言技术门户网站RELATE、COVID-19数据集和对质量评估的不同评价。