Existing Active SLAM methodologies face issues such as slow exploration speed and suboptimal paths. To address these limitations, we propose a hybrid framework combining a Path-Uncertainty Co-Optimization Deep Reinforcement Learning framework and a Lightweight Stagnation Detection mechanism. The Path-Uncertainty Co-Optimization framework jointly optimizes travel distance and map uncertainty through a dual-objective reward function, balancing exploration and exploitation. The Lightweight Stagnation Detection reduces redundant exploration through Lidar Static Anomaly Detection and Map Update Stagnation Detection, terminating episodes on low expansion rates. Experimental results show that compared with the frontier-based method and RRT method, our approach shortens exploration time by up to 65% and reduces path distance by up to 42%, significantly improving exploration efficiency in complex environments while maintaining reliable map completeness. Ablation studies confirm that the collaborative mechanism accelerates training convergence. Empirical validation on a physical robotic platform demonstrates the algorithm's practical applicability and its successful transferability from simulation to real-world environments.
翻译:现有的主动SLAM方法面临探索速度慢和路径次优等问题。为应对这些局限,本文提出一种混合框架,融合了路径-不确定性协同优化的深度强化学习框架与轻量级停滞检测机制。路径-不确定性协同优化框架通过双目标奖励函数联合优化行进距离与地图不确定性,平衡探索与利用。轻量级停滞检测通过激光雷达静态异常检测与地图更新停滞检测减少冗余探索,在扩展率较低时终止探索回合。实验结果表明,相较于基于前沿的方法和RRT方法,本方法将探索时间缩短最高达65%,路径距离减少最高达42%,在复杂环境中显著提升探索效率,同时保持可靠的地图完整性。消融研究证实协同机制加速了训练收敛。在物理机器人平台上的实证验证展示了算法的实际适用性及其从仿真到现实环境的成功迁移能力。