PURPOSE: Surgical workflow and skill analysis are key technologies for the next generation of cognitive surgical assistance systems. These systems could increase the safety of the operation through context-sensitive warnings and semi-autonomous robotic assistance or improve training of surgeons via data-driven feedback. In surgical workflow analysis up to 91% average precision has been reported for phase recognition on an open data single-center dataset. In this work we investigated the generalizability of phase recognition algorithms in a multi-center setting including more difficult recognition tasks such as surgical action and surgical skill. METHODS: To achieve this goal, a dataset with 33 laparoscopic cholecystectomy videos from three surgical centers with a total operation time of 22 hours was created. Labels included annotation of seven surgical phases with 250 phase transitions, 5514 occurences of four surgical actions, 6980 occurences of 21 surgical instruments from seven instrument categories and 495 skill classifications in five skill dimensions. The dataset was used in the 2019 Endoscopic Vision challenge, sub-challenge for surgical workflow and skill analysis. Here, 12 teams submitted their machine learning algorithms for recognition of phase, action, instrument and/or skill assessment. RESULTS: F1-scores were achieved for phase recognition between 23.9% and 67.7% (n=9 teams), for instrument presence detection between 38.5% and 63.8% (n=8 teams), but for action recognition only between 21.8% and 23.3% (n=5 teams). The average absolute error for skill assessment was 0.78 (n=1 team). CONCLUSION: Surgical workflow and skill analysis are promising technologies to support the surgical team, but are not solved yet, as shown by our comparison of algorithms. This novel benchmark can be used for comparable evaluation and validation of future work.
翻译:外科工作流程和技能分析是下一代认知外科手术协助系统的关键技术。 这些系统可以通过对背景敏感的警告和半自动机器人协助或通过数据驱动反馈改善外科医生的培训来提高操作的安全性。 在外科工作流程分析中,报告平均精确度高达91%,以便在开放的数据单中数据集中进行阶段识别。 在这项工作中,我们调查了在多中心设置中阶段识别算法的通用性,包括更难识别的任务,如手术绝对动作和外科手术技能。 方法S:为实现这一目标,三个外科中心的33个大肠杆眼细胞切除术视频集成,总运行时间为22小时。 在外科手术流程分析中,共使用了7个外科手术阶段的注释,为5514个手术阶段,为6980个阶段,为7个仪器类别中的21个外科手术工具的确认,但为5个技能层面的495个技能分类。 数据集用于2019年的《内科前景》挑战,为手术工作流程和技能分析的分置评估。 12个小组提交了用于20 % 和23 % 的内勤分析,为23 % 。