Previous soft tissue manipulation studies assumed that the grasping point was known and the target deformation can be achieved. During the operation, the constraints are supposed to be constant, and there is no obstacles around the soft tissue. To go beyond these assumptions, a deep reinforcement learning framework with prior knowledge is proposed for soft tissue manipulation under unknown constraints, such as the force applied by fascia. The prior knowledge is represented through an intuitive manipulation strategy. As an action of the agent, a regulator factor is used to coordinate the intuitive approach and the deliberate network. A reward function is designed to balance the exploration and exploitation for large deformation. Successful simulation results verify that the proposed framework can manipulate the soft tissue while avoiding obstacles and adding new position constraints. Compared with the soft actor-critic (SAC) algorithm, the proposed framework can accelerate the training procedure and improve the generalization.
翻译:先前的软组织操纵研究假定,掌握点已经为人所知,目标变形可以实现。 在操作过程中,限制应该是固定的,软组织周围没有障碍。除了这些假设之外,还提出一个事先知情的深强化学习框架,用于软组织操纵,其限制不明,如Fascia的武力。先前的知识通过直觉操纵战略来体现。作为代理人的行动,一个调节因素被用来协调直觉方法和蓄意网络。一个奖励功能旨在平衡对大规模变形的探索和开发。成功的模拟结果可以证实拟议框架可以操纵软组织,同时避免障碍和增加新的位置限制。与软性行为者-批评算法相比,拟议框架可以加快培训程序,改进一般化。