Open procedures represent the dominant form of surgery worldwide. Artificial intelligence (AI) has the potential to optimize surgical practice and improve patient outcomes, but efforts have focused primarily on minimally invasive techniques. Our work overcomes existing data limitations for training AI models by curating, from YouTube, the largest dataset of open surgical videos to date: 1997 videos from 23 surgical procedures uploaded from 50 countries. Using this dataset, we developed a multi-task AI model capable of real-time understanding of surgical behaviors, hands, and tools - the building blocks of procedural flow and surgeon skill. We show that our model generalizes across diverse surgery types and environments. Illustrating this generalizability, we directly applied our YouTube-trained model to analyze open surgeries prospectively collected at an academic medical center and identified kinematic descriptors of surgical skill related to efficiency of hand motion. Our Annotated Videos of Open Surgery (AVOS) dataset and trained model will be made available for further development of surgical AI.
翻译:人工智能(AI)具有优化外科手术和改进病人结果的潜力,但工作主要侧重于最低侵入性技术。我们的工作克服了培训人工智能模型的现有数据限制,从YouTube中整理了迄今最大的开放外科视频数据集:1997年50个国家上传的23个外科手术程序的视频。我们利用这一数据集开发了一个多任务AI模型,能够实时了解外科手术行为、手和工具——程序流动和外科手术技能的构件。我们展示了我们的模型,将各种外科手术类型和环境都归纳为通用的。我们用这种概括性来直接应用了我们的YouTube培训模型,以分析预期在学术医疗中心收集的开放外科手术,并确定了与手动效率有关的外科手术技能的动态描述器。我们将提供开放外科手术(AVOS)的附加录像和经过培训的模型,以便进一步开发外科人工智能。