Health literacy has emerged as a crucial factor in making appropriate health decisions and ensuring treatment outcomes. However, medical jargon and the complex structure of professional language in this domain make health information especially hard to interpret. Thus, there is an urgent unmet need for automated methods to enhance the accessibility of the biomedical literature to the general population. This problem can be framed as a type of translation problem between the language of healthcare professionals, and that of the general public. In this paper, we introduce the novel task of automated generation of lay language summaries of biomedical scientific reviews, and construct a dataset to support the development and evaluation of automated methods through which to enhance the accessibility of the biomedical literature. We conduct analyses of the various challenges in solving this task, including not only summarization of the key points but also explanation of background knowledge and simplification of professional language. We experiment with state-of-the-art summarization models as well as several data augmentation techniques, and evaluate their performance using both automated metrics and human assessment. Results indicate that automatically generated summaries produced using contemporary neural architectures can achieve promising quality and readability as compared with reference summaries developed for the lay public by experts (best ROUGE-L of 50.24 and Flesch-Kincaid readability score of 13.30). We also discuss the limitations of the current attempt, providing insights and directions for future work.
翻译:卫生知识普及已成为在做出适当的卫生决定和确保治疗结果方面的一个关键因素,然而,医学术语和该领域专业语言的复杂结构使得健康信息特别难于解释。因此,迫切需要采用自动化方法提高普通民众获得生物医学文献的机会,这个问题可被描述为保健专业人员和一般公众语言之间的一种翻译问题。在本文件中,我们引入了以自动化方式生成生物医学科学审查非专业语言摘要的新任务,并构建了一个数据集,以支持开发和评估提高生物医学文献可获取性的各种自动化方法。我们分析了在完成这项任务时所面临的各种挑战,包括不仅对关键点进行总结,而且对背景知识和专业语言的简化作出解释。我们试验的是最新综合模型以及若干数据增强技术,并使用自动化计量和人文评估来评价其绩效。结果显示,与专家为公众编写的参考摘要相比,自动生成的概要可以实现有希望的质量和可读性。我们还讨论了50.24年期预估前程和30年期预估前程,我们还讨论了50.24年预估前程,并讨论了未来预估力。