Nowadays, there are more surgical procedures that are being performed using minimally invasive surgery (MIS). This is due to its many benefits, such as minimal post-operative problems, less bleeding, minor scarring, and a speedy recovery. However, the MIS's constrained field of view, small operating room, and indirect viewing of the operating scene could lead to surgical tools colliding and potentially harming human organs or tissues. Therefore, MIS problems can be considerably reduced, and surgical procedure accuracy and success rates can be increased by using an endoscopic video feed to detect and monitor surgical instruments in real-time. In this paper, a set of improvements made to the YOLOV5 object detector to enhance the detection of surgical instruments was investigated, analyzed, and evaluated. In doing this, we performed performance-based ablation studies, explored the impact of altering the YOLOv5 model's backbone, neck, and anchor structural elements, and annotated a unique endoscope dataset. Additionally, we compared the effectiveness of our ablation investigations with that of four additional SOTA object detectors (YOLOv7, YOLOR, Scaled-YOLOv4 and YOLOv3-SPP). Except for YOLOv3-SPP, which had the same model performance of 98.3% in mAP and a similar inference speed, all of our benchmark models, including the original YOLOv5, were surpassed by our top refined model in experiments using our fresh endoscope dataset.
翻译:目前,使用侵入性最小的外科手术(MIS)的外科手术程序越来越多,而且手术程序的准确性和成功率可以大大提高,这是因为它有许多好处,例如手术后问题极小,出血较少,伤疤小,以及迅速恢复。然而,对MIS的有限视野、小手术室和间接观察操作现场,都可能导致外科手术工具碰撞和可能伤害人体器官或组织。因此,管理信息系统问题可以大大减少,通过使用内窥镜视频接口实时检测和监测外科仪器可以提高外科手术程序的准确性和成功率。在本文中,对YOLOV5物体探测器进行了一系列改进,以加强外科仪器探测。在做这项工作时,我们进行了基于性能的反动研究,探讨了改变YOLOv5模型的骨干、颈部和锚固定结构元的影响,并附加了一个独特的内镜模型。此外,我们所有的直线调查与另外四台SOTA物体探测器(YOLOV7、YOL3Aslobal-OSOL)的原始性能数据(包括YPPO3OSl-OSlal)的原变速度数据。