在自主机器人辅助外科手术中使用安全强化学习系统,对自动机器人辅助外科手术中的组织裁减进行正式核查 (Safe Reinforcement Learning using Formal Verification for Tissue Retraction in Autonomous Robotic-Assisted Surgery)

Deep Reinforcement Learning (DRL) is a viable solution for automating repetitive surgical subtasks due to its ability to learn complex behaviours in a dynamic environment. This task automation could lead to reduced surgeon's cognitive workload, increased precision in critical aspects of the surgery, and fewer patient-related complications. However, current DRL methods do not guarantee any safety criteria as they maximise cumulative rewards without considering the risks associated with the actions performed. Due to this limitation, the application of DRL in the safety-critical paradigm of robot-assisted Minimally Invasive Surgery (MIS) has been constrained. In this work, we introduce a Safe-DRL framework that incorporates safety constraints for the automation of surgical subtasks via DRL training. We validate our approach in a virtual scene that replicates a tissue retraction task commonly occurring in multiple phases of an MIS. Furthermore, to evaluate the safe behaviour of the robotic arms, we formulate a formal verification tool for DRL methods that provides the probability of unsafe configurations. Our results indicate that a formal analysis guarantees safety with high confidence such that the robotic instruments operate within the safe workspace and avoid hazardous interaction with other anatomical structures.

翻译：深入强化学习(DRL)是使重复性外科子任务自动化的一个可行解决方案,因为它有能力在动态环境中学习复杂的行为。任务自动化可以减少外科医生的认知工作量,提高外科手术关键方面的精确度,减少与病人有关的并发症。然而,目前的DRL方法并不能保证任何安全标准,因为如果不考虑所采取行动的风险,就会获得最大的累积回报。由于这一局限性,在机器人辅助小型侵入性外科手术(MIS)的安全关键范式中应用DRL受到制约。在这项工作中,我们引入了一个安全DRL框架,通过DRL培训将外科子任务自动化的安全限制纳入其中。我们验证了我们在虚拟环境中的做法,在虚拟环境中复制了通常发生在多阶段IMIS中的组织回收任务。此外,为了评估机器人武器的安全行为,我们为DRL方法制定了一个正式的核查工具,提供不安全配置的可能性。我们的结果表明,正式分析保证了安全性,因为机器人仪器在安全工作空间内操作,避免与其他原子结构发生危险互动。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【干货书】机器人元素Elements of Robotics ，311页pdf

专知会员服务

38+阅读 · 2021年4月16日

【IJCAI2020】TransOMCS: 从语言图谱到常识图谱

专知会员服务

35+阅读 · 2020年5月4日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日