In biomedical natural language processing, named entity recognition (NER) and named entity normalization (NEN) are key tasks that enable the automatic extraction of biomedical entities (e.g., diseases and chemicals) from the ever-growing biomedical literature. In this paper, we present BERN2 (Advanced Biomedical Entity Recognition and Normalization), a tool that improves the previous neural network-based NER tool (Kim et al., 2019) by employing a multi-task NER model and neural network-based NEN models to achieve much faster and more accurate inference. We hope that our tool can help annotate large-scale biomedical texts more accurately for various tasks such as biomedical knowledge graph construction.
翻译:在生物医学自然语言处理中,名称实体识别(NER)和名称实体正常化(NEN)是使生物医学实体(如疾病和化学品)自动从不断增长的生物医学文献中提取的关键任务,在本文件中,我们介绍了BERN2(高级生物医学实体识别和正常化)这一工具,它通过使用多任务NER模型和神经网络NEN模型,改进了以前以神经网络为基础的NER工具(Kim等人,2019年),以便实现更快和更准确的推论。 我们希望,我们的工具能够帮助更准确地为生物医学知识图的构建等各项任务批发大规模生物医学文本。