Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has changed little in decades. The PDF format for sharing research papers is widely used due to its portability, but it has significant downsides including: static content, poor accessibility for low-vision readers, and difficulty reading on mobile devices. This paper explores the question "Can recent advances in AI and HCI power intelligent, interactive, and accessible reading interfaces -- even for legacy PDFs?" We describe the Semantic Reader Project, a collaborative effort across multiple institutions to explore automatic creation of dynamic reading interfaces for research papers. Through this project, we've developed ten research prototype interfaces and conducted usability studies with more than 300 participants and real-world users showing improved reading experiences for scholars. We've also released a production reading interface for research papers that will incorporate the best features as they mature. We structure this paper around challenges scholars and the public face when reading research papers -- Discovery, Efficiency, Comprehension, Synthesis, and Accessibility -- and present an overview of our progress and remaining open challenges.
翻译:学术出版物是学者向他人传递知识的关键。然而,研究论文信息密集,随着科学文献的数量增长,支持阅读过程的新技术需求也在增加。与利用互联网技术转换查找论文的过程相比,阅读研究论文的经验几十年来几乎没有改变。由于其便携性,PDF格式被广泛用于分享研究论文,但它有显着的缺点,如静态内容、对低视觉能力读者的差的较差的可访问性以及在移动设备上阅读的困难。本文探讨“最近在人工智能和人机交互方面的先进技术能否为遗留的PDF文件提供智能、交互式、易于访问的阅读界面?”我们描述了语义阅读器项目,这是多个机构开展的协作努力,研究论文动态阅读界面的自动创建。通过这个项目,我们开发了10个研究原型界面,并与300多个参与者和真实用户进行了可用性研究,显示学者的阅读体验得到了改善。我们还发布了一个用于研究论文的生产阅读界面,将合并最好的特性当它们成为成熟的特性。我们通过挑战学者和公众在阅读研究论文时面临的问题——发现、效率、理解、综合和可访问性——来组织这篇论文,并介绍我们的进展以及尚未解决的问题。