We present SurgeonAssist-Net: a lightweight framework making action-and-workflow-driven virtual assistance, for a set of predefined surgical tasks, accessible to commercially available optical see-through head-mounted displays (OST-HMDs). On a widely used benchmark dataset for laparoscopic surgical workflow, our implementation competes with state-of-the-art approaches in prediction accuracy for automated task recognition, and yet requires 7.4x fewer parameters, 10.2x fewer floating point operations per second (FLOPS), is 7.0x faster for inference on a CPU, and is capable of near real-time performance on the Microsoft HoloLens 2 OST-HMD. To achieve this, we make use of an efficient convolutional neural network (CNN) backbone to extract discriminative features from image data, and a low-parameter recurrent neural network (RNN) architecture to learn long-term temporal dependencies. To demonstrate the feasibility of our approach for inference on the HoloLens 2 we created a sample dataset that included video of several surgical tasks recorded from a user-centric point-of-view. After training, we deployed our model and cataloged its performance in an online simulated surgical scenario for the prediction of the current surgical task. The utility of our approach is explored in the discussion of several relevant clinical use-cases. Our code is publicly available at https://github.com/doughtmw/surgeon-assist-net.
翻译:我们展示了外科外科手术网络:一个轻量级框架,使行动与工作驱动的虚拟援助更快,用于一套预先定义的外科手术任务,可供商业上可获取的光学透视显示显示器(OST-HMDs)使用。在一个广泛使用的大肠杆菌外科手术工作流程基准数据集上,我们的实施与用于自动任务识别的预测精确度的最先进方法竞争,然而,我们还需要一个较少的参数,即每秒10.2x更少的浮点操作(FLOPS),在CPU上推断速度更快为7.0x,并且能够在微软 HolonetLens 2 OST-HMD上接近实时性能。为此,我们利用高效的革命神经网络主干网(CNN)从图像数据中提取歧视性特征,以及一个低参数的经常性神经网络(RNNN)架构,以学习长期的时间依赖性。为了在HoloLens 2上显示我们的推断方法的可行性,我们创建了一个样本数据集,其中包括从用户-中心2号2号 Ocloal-lical-lical-lical ex ex assalview imal view a ex ex ex