Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of <instrument, verb, target> triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results, their significance, and useful insights for future research directions and applications in surgery.
翻译:将外科手术活动正规化为用过的仪器、行动和目标解剖的三胞胎的三胞胎,正在成为外科手术活动模型的金标准标准方法。这种正规化的好处是,这种正规化有助于更详细地了解工具-问题互动,可以用来为图像制导手术提供更好的人工智能协助。早期的努力和2021年推出的骨盆三角挑战汇集了各种技术,旨在从外科片中识别这些三胞胎。还估算了三胞胎的空间位置,将为计算机辅助干预提供更精确的内科环境认知决策支持。本文介绍了CholecTriplet 2022挑战,它把外科手术三胞胎模型从识别扩展到检测。它包括每个显眼外科手术工具(或工具)作为关键行为者的薄弱、超强的捆绑框本地化,以及以 < 仪表、 动词、 目标 > 三胞三胞体的形式对每项工具活动进行模型化。本文描述了一种基线方法和10种新的深层次算法,用来解决任务的难题。它提供了深入的深度分析、方法分析方法上的重要方向。