Cardiovascular diseases remain the leading cause of global mortality, with minimally invasive treatment options offered through endovascular interventions. However, the precision and adaptability of current robotic systems for endovascular navigation are limited by heuristic control, low autonomy, and the absence of haptic feedback. This thesis presents an integrated AI-driven framework for autonomous guidewire navigation in complex vascular environments, addressing key challenges in data availability, simulation fidelity, and navigational accuracy. A high-fidelity, real-time simulation platform, CathSim, is introduced for reinforcement learning based catheter navigation, featuring anatomically accurate vascular models and contact dynamics. Building on CathSim, the Expert Navigation Network is developed, a policy that fuses visual, kinematic, and force feedback for autonomous tool control. To mitigate data scarcity, the open-source, bi-planar fluoroscopic dataset Guide3D is proposed, comprising more than 8,700 annotated images for 3D guidewire reconstruction. Finally, SplineFormer, a transformer-based model, is introduced to directly predict guidewire geometry as continuous B-spline parameters, enabling interpretable, real-time navigation. The findings show that combining high-fidelity simulation, multimodal sensory fusion, and geometric modelling substantially improves autonomous endovascular navigation and supports safer, more precise minimally invasive procedures.
翻译:心血管疾病仍是全球首要致死病因,血管内介入手术提供了微创治疗方案。然而,当前用于血管内导航的机器人系统在精确性与适应性方面受限于启发式控制、低自主性及触觉反馈的缺失。本论文提出一种集成化人工智能驱动框架,用于复杂血管环境中的导丝自主导航,解决了数据可用性、仿真保真度与导航精度等关键挑战。研究引入了高保真实时仿真平台CathSim,用于基于强化学习的导管导航,该平台具备解剖学精确的血管模型与接触动力学模拟。基于CathSim,我们开发了专家导航网络——一种融合视觉、运动学与力反馈以实现器械自主控制的策略。为缓解数据稀缺问题,提出了开源双平面透视影像数据集Guide3D,包含超过8,700张标注图像用于三维导丝重建。最后,引入了基于Transformer的模型SplineFormer,可直接将导丝几何形态预测为连续B样条参数,实现可解释的实时导航。研究结果表明,结合高保真仿真、多模态传感融合与几何建模能显著提升血管内自主导航性能,并为更安全、更精确的微创手术提供支持。