Large AI models, or foundation models, are models recently emerging with massive scales both parameter-wise and data-wise, the magnitudes of which often reach beyond billions. Once pretrained, large AI models demonstrate impressive performance in various downstream tasks. A concrete example is the recent debut of ChatGPT, whose capability has compelled people's imagination about the far-reaching influence that large AI models can have and their potential to transform different domains of our life. In health informatics, the advent of large AI models has brought new paradigms for the design of methodologies. The scale of multimodality data in the biomedical and health domain has been ever-expanding especially since the community embraced the era of deep learning, which provides the ground to develop, validate, and advance large AI models for breakthroughs in health-related areas. This article presents an up-to-date comprehensive review of large AI models, from background to their applications. We identify seven key sectors that large AI models are applicable and might have substantial influence, including 1) molecular biology and drug discovery; 2) medical diagnosis and decision-making; 3) medical imaging and vision; 4) medical informatics; 5) medical education; 6) public health; and 7) medical robotics. We examine their challenges in health informatics, followed by a critical discussion about potential future directions and pitfalls of large AI models in transforming the field of health informatics.
翻译:大规模人工智能(AI)模型,或者基础模型,是近期出现的模型,具有巨大的参数和数据规模,其规模往往超过十亿。一旦预训练,大规模AI模型在各种下游任务中展现出惊人的性能。最近推出的ChatGPT就是一个具体例子,其能力引发了人们对大规模AI模型可能产生的深远影响和在不同领域转型的潜力的想象。在健康信息学中,大规模AI模型的出现为方法设计带来了新的范例。生物医学和健康领域中的多模态数据规模一直在扩大,特别是自深度学习时代得以推广以来,这为开发、验证和推进大型AI模型以实现健康相关领域的突破提供了基础。本文提供了关于大型人工智能模型的最新全面综述,从背景到应用。我们确定了大型人工智能模型适用并可能具有重大影响的七个关键领域,包括:1)分子生物学和药物开发;2)医疗诊断和决策;3)医学影像和视觉;4)医学信息学;5)医学教育;6)公共卫生;7)医疗机器人。我们检查了在健康信息学中的挑战,随后批判性地探讨了大规模人工智能模型在改变健康信息学领域中可能出现的未来方向和陷阱。