Occlusion from obstacles, such as foliage, can severely obstruct a robot's sensors, impairing scene understanding. We show that "peering", a characteristic side-to-side movement used by insects to overcome their visual limitations, can also allow robots to markedly improve visual reasoning under partial occlusion. This is accomplished by applying core signal processing principles, specifically optical synthetic aperture sensing, together with the vision reasoning capabilities of modern large multimodal models. Peering enables real-time, high-resolution, and wavelength-independent perception, which is crucial for vision-based scene understanding across a wide range of applications. The approach is low-cost and immediately deployable on any camera-equipped robot. We investigated different peering motions and occlusion masking strategies, demonstrating that, unlike peering, state-of-the-art multi-view 3D vision techniques fail in these conditions due to their high susceptibility to occlusion. Robots that see through occlusion will gain superior perception abilities - including enhanced scene understanding, situational awareness, camouflage breaking, and advanced navigation.
翻译:由树叶等障碍物造成的遮挡会严重阻碍机器人的传感器,损害场景理解能力。我们证明,"窥视"这种昆虫用于克服视觉局限性的特征性左右移动方式,同样能使机器人在部分遮挡条件下显著提升视觉推理能力。该方法通过应用核心信号处理原理(特别是光学合成孔径传感技术),结合现代大型多模态模型的视觉推理能力实现。窥视运动支持实时、高分辨率且与波长无关的感知,这对于广泛应用的视觉场景理解至关重要。该方案成本低廉,可立即部署于任何配备摄像头的机器人。我们研究了不同的窥视运动模式与遮挡掩蔽策略,结果表明:与窥视方法不同,最先进的多视角三维视觉技术因对遮挡高度敏感而在此类条件下失效。能够穿透遮挡的机器人将获得卓越的感知能力——包括增强的场景理解、态势感知、伪装破除及高级导航功能。