Purpose: We propose a formal framework for the modeling and segmentation of minimally-invasive surgical tasks using a unified set of motion primitives (MPs) to enable more objective labeling and the aggregation of different datasets. Methods: We model dry-lab surgical tasks as finite state machines, representing how the execution of MPs as the basic surgical actions results in the change of surgical context, which characterizes the physical interactions among tools and objects in the surgical environment. We develop methods for labeling surgical context based on video data and for automatic translation of context to MP labels. We then use our framework to create the COntext and Motion Primitive Aggregate Surgical Set (COMPASS), including six dry-lab surgical tasks from three publicly-available datasets (JIGSAWS, DESK, and ROSMA), with kinematic and video data and context and MP labels. Results: Our context labeling method achieves near-perfect agreement between consensus labels from crowd-sourcing and expert surgeons. Segmentation of tasks to MPs results in the creation of the COMPASS dataset that nearly triples the amount of data for modeling and analysis and enables the generation of separate transcripts for the left and right tools. Conclusion: The proposed framework results in high quality labeling of surgical data based on context and fine-grained MPs. Modeling surgical tasks with MPs enables the aggregation of different datasets and the separate analysis of left and right hands for bimanual coordination assessment. Our formal framework and aggregate dataset can support the development of explainable and multi-granularity models for improved surgical process analysis, skill assessment, error detection, and autonomy.
翻译:摘要:本文提出了一个形式化框架,用于使用统一的运动原语(MPs)对微创手术任务进行建模和分割,以实现更客观的标注和不同数据集的聚合。我们将干湿实验室手术任务建模为有限状态机,表示MP执行作为基本手术动作如何导致手术环境中工具和物体之间物理交互的变化。我们开发了基于视频数据进行手术环境标注和自动将环境转化为MP标签的方法。然后,我们使用我们的框架创建了COntext和Motion Primitive Aggregate Surgical Set(COMPASS)总体数据集,其中包括三个公开数据集(JIGSAWS、DESK、ROSMA)中的六个干湿实验室手术任务,以及运动学和视频数据、环境和MP标签。我们的环境标注方法实现了众包和专家外科医生间共识标签近乎完美的一致性。任务分割为MP,结果是创建了COMPASS数据集,该数据集将建模和分析的数据量增加了近三倍,并且可以为左右手工具生成单独的记录。所提出的框架通过上下文和细粒度MP对手术数据进行高质量的标注。使用MP对手术任务进行建模,可以聚合不同的数据集,并对双手协调评估进行分析。我们的正式框架和总体数据集可以支持发展可解释和多层级模型,以改进手术过程分析、技能评估、错误检测和自治。