Challenges in traversing dynamic clutters lie mainly in the efficient perception of the environmental dynamics and the generation of evasive behaviors considering obstacle movement. Previous solutions have made progress in explicitly modeling the dynamic obstacle motion for avoidance, but this key dependency of decision-making is time-consuming and unreliable in highly dynamic scenarios with occlusions. On the contrary, without introducing object detection, tracking, and prediction, we empower the reinforcement learning (RL) with single LiDAR sensing to realize an autonomous flight system directly from point to motion. For exteroception, a depth sensing distance map achieving fixed-shape, low-resolution, and detail-safe is encoded from raw point clouds, and an environment change sensing point flow is adopted as motion features extracted from multi-frame observations. These two are integrated into a lightweight and easy-to-learn representation of complex dynamic environments. For action generation, the behavior of avoiding dynamic threats in advance is implicitly driven by the proposed change-aware sensing representation, where the policy optimization is indicated by the relative motion modulated distance field. With the deployment-friendly sensing simulation and dynamics model-free acceleration control, the proposed system shows a superior success rate and adaptability to alternatives, and the policy derived from the simulator can drive a real-world quadrotor with safe maneuvers.
翻译:穿越动态障碍物的挑战主要在于高效感知环境动态性以及考虑障碍物运动生成规避行为。现有方法在显式建模动态障碍物运动以实现避障方面取得进展,但这一决策关键依赖在存在遮挡的高度动态场景中耗时且不可靠。相反,我们无需引入目标检测、跟踪与预测,仅通过单激光雷达感知赋能强化学习(RL),实现从点到运动的自主飞行系统。在外感知方面,从原始点云编码出具备固定形状、低分辨率且保留细节的深度感知距离图,并采用从多帧观测中提取的运动特征——环境变化感知点流。二者融合为轻量化且易于学习的复杂动态环境表征。在动作生成方面,通过所提出的变化感知表征隐式驱动提前规避动态威胁的行为,其中策略优化由相对运动调制的距离场指示。借助部署友好的感知仿真与无动力学模型加速控制,所提系统展现出优于替代方案的通过率与适应性,且从仿真器推导的策略可驱动真实四旋翼飞行器实现安全机动。