Objective: Surgical activity recognition is a fundamental step in computer-assisted interventions. This paper reviews the state-of-the-art in methods for automatic recognition of fine-grained gestures in robotic surgery focusing on recent data-driven approaches and outlines the open questions and future research directions. Methods: An article search was performed on 5 bibliographic databases with the following search terms: robotic, robot-assisted, JIGSAWS, surgery, surgical, gesture, fine-grained, surgeme, action, trajectory, segmentation, recognition, parsing. Selected articles were classified based on the level of supervision required for training and divided into different groups representing major frameworks for time series analysis and data modelling. Results: A total of 52 articles were reviewed. The research field is showing rapid expansion, with the majority of articles published in the last 4 years. Deep-learning-based temporal models with discriminative feature extraction and multi-modal data integration have demonstrated promising results on small surgical datasets. Currently, unsupervised methods perform significantly less well than the supervised approaches. Conclusion: The development of large and diverse open-source datasets of annotated demonstrations is essential for development and validation of robust solutions for surgical gesture recognition. While new strategies for discriminative feature extraction and knowledge transfer, or unsupervised and semi-supervised approaches, can mitigate the need for data and labels, they have not yet been demonstrated to achieve comparable performance. Important future research directions include detection and forecast of gesture-specific errors and anomalies. Significance: This paper is a comprehensive and structured analysis of surgical gesture recognition methods aiming to summarize the status of this rapidly evolving field.
翻译:目标:对5个书目数据库进行文章搜索,其搜索条件如下:机器人、机器人辅助、JIGSAWS、外科、外科、手势、细刮、激增、动作、轨迹、分解、识别、解析。部分文章根据培训所需的监督水平进行分类,分为代表时间序列分析和数据建模主要框架的不同组别。结果:共审查了52篇文章。研究领域显示迅速扩展,大多数文章在过去四年中发表。基于深学习的时间模型,带有歧视性特征提取和多模式数据整合,显示了小手术数据集的可喜结果。目前,未完成的方法比监督的方法表现得要差得多。这些方法根据培训所需的监督水平进行分类,分为代表时间序列分析和数据建模方向的主要框架的不同组别。这个结果:总共审查了52篇文章。研究领域显示迅速扩展,大多数文章在过去四年中发表。基于深层次特征提取、外科外科外科数据整合的深层次定位模型显示,这种实地大规模和多样化的开源数据设置,对于快速解读的外科诊断和精确度分析,对于快速解读的诊断方法来说是必要的。