Surgical skills have a great influence on surgical safety and patients' well-being. Traditional assessment of surgical skills involves strenuous manual efforts, which lacks efficiency and repeatability. Therefore, we attempt to automatically predict how well the surgery is performed using the surgical video. In this paper, a unified multi-path framework for automatic surgical skill assessment is proposed, which takes care of multiple composing aspects of surgical skills, including surgical tool usage, intraoperative event pattern, and other skill proxies. The dependency relationships among these different aspects are specially modeled by a path dependency module in the framework. We conduct extensive experiments on the JIGSAWS dataset of simulated surgical tasks, and a new clinical dataset of real laparoscopic surgeries. The proposed framework achieves promising results on both datasets, with the state-of-the-art on the simulated dataset advanced from 0.71 Spearman's correlation to 0.80. It is also shown that combining multiple skill aspects yields better performance than relying on a single aspect.
翻译:外科手术技能对外科安全和病人的安康有很大影响。传统的外科手术技能评估涉及艰苦的人工工作,缺乏效率和重复性。因此,我们试图自动预测手术使用外科视频的精度。在本文中,提议了一个统一的多途径框架,用于自动外科手术技能评估,这个框架将照顾外科手术技能的多重组成方面,包括外科工具的使用、内科事件模式和其他技能代理。这些不同方面的依赖关系由框架中的路径依赖模块特别建模。我们对模拟外科手术任务的JIGSAWS数据集和真实腹腔外科手术手术的新临床数据集进行了广泛的实验。拟议框架在这两个数据集上都取得了大有希望的结果,模拟数据集从0.71 Spearman的关联到0.80。还显示,多重技能方面的结合比依赖单一方面更能产生更好的效果。