与黑洞基准相比的外外科工作流量和技能分析机器学习算数和技能分析的比较比较校验 (Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark)

Martin Wagner,Beat-Peter Müller-Stich,Anna Kisilenko,Duc Tran,Patrick Heger,Lars Mündermann,David M Lubotsky,Benjamin Müller,Tornike Davitashvili,Manuela Capek,Annika Reinke,Tong Yu,Armine Vardazaryan,Chinedu Innocent Nwoye,Nicolas Padoy,Xinyang Liu,Eung-Joo Lee,Constantin Disch,Hans Meine,Tong Xia,Fucang Jia,Satoshi Kondo,Wolfgang Reiter,Yueming Jin,Yonghao Long,Meirui Jiang,Qi Dou,Pheng Ann Heng,Isabell Twick,Kadir Kirtac,Enes Hosgor,Jon Lindström Bolmgren,Michael Stenzel,Björn von Siemens,Hannes G. Kenngott,Felix Nickel,Moritz von Frankenberg,Franziska Mathis-Ullrich,Lena Maier-Hein,Stefanie Speidel,Sebastian Bodenstedt

PURPOSE: Surgical workflow and skill analysis are key technologies for the next generation of cognitive surgical assistance systems. These systems could increase the safety of the operation through context-sensitive warnings and semi-autonomous robotic assistance or improve training of surgeons via data-driven feedback. In surgical workflow analysis up to 91% average precision has been reported for phase recognition on an open data single-center dataset. In this work we investigated the generalizability of phase recognition algorithms in a multi-center setting including more difficult recognition tasks such as surgical action and surgical skill. METHODS: To achieve this goal, a dataset with 33 laparoscopic cholecystectomy videos from three surgical centers with a total operation time of 22 hours was created. Labels included annotation of seven surgical phases with 250 phase transitions, 5514 occurences of four surgical actions, 6980 occurences of 21 surgical instruments from seven instrument categories and 495 skill classifications in five skill dimensions. The dataset was used in the 2019 Endoscopic Vision challenge, sub-challenge for surgical workflow and skill analysis. Here, 12 teams submitted their machine learning algorithms for recognition of phase, action, instrument and/or skill assessment. RESULTS: F1-scores were achieved for phase recognition between 23.9% and 67.7% (n=9 teams), for instrument presence detection between 38.5% and 63.8% (n=8 teams), but for action recognition only between 21.8% and 23.3% (n=5 teams). The average absolute error for skill assessment was 0.78 (n=1 team). CONCLUSION: Surgical workflow and skill analysis are promising technologies to support the surgical team, but are not solved yet, as shown by our comparison of algorithms. This novel benchmark can be used for comparable evaluation and validation of future work.

翻译：外科工作流程和技能分析是下一代认知外科手术协助系统的关键技术。这些系统可以通过对背景敏感的警告和半自动机器人协助或通过数据驱动反馈改善外科医生的培训来提高操作的安全性。在外科工作流程分析中,报告平均精确度高达91%,以便在开放的数据单中数据集中进行阶段识别。在这项工作中,我们调查了在多中心设置中阶段识别算法的通用性,包括更难识别的任务,如手术绝对动作和外科手术技能。方法S:为实现这一目标,三个外科中心的33个大肠杆眼细胞切除术视频集成,总运行时间为22小时。在外科手术流程分析中,共使用了7个外科手术阶段的注释,为5514个手术阶段,为6980个阶段,为7个仪器类别中的21个外科手术工具的确认,但为5个技能层面的495个技能分类。数据集用于2019年的《内科前景》挑战,为手术工作流程和技能分析的分置评估。 12个小组提交了用于20 % 和23 % 的内勤分析,为23 % 。

相关内容

Machine Learning

关注 2241

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【经典书】使用机器学习R语言，149页pdf，Practical Machine Learning in R

专知会员服务

24+阅读 · 2021年1月13日

【健康医疗中的机器学习算法综述】A Survey Of Machine Learning Algorithms In Health Care

专知会员服务

14+阅读 · 2019年11月19日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

吴恩达新书《Machine Learning Yearning》完整中文版

专知会员服务

147+阅读 · 2019年10月27日