Real-time semantic segmentation has received considerable attention due to growing demands in many practical applications, such as autonomous vehicles, robotics, etc. Existing real-time segmentation approaches often utilize feature fusion to improve segmentation accuracy. However, they fail to fully consider the feature information at different resolutions and the receptive fields of the networks are relatively limited, thereby compromising the performance. To tackle this problem, we propose a light Cascaded Selective Resolution Network (CSRNet) to improve the performance of real-time segmentation through multiple context information embedding and enhanced feature aggregation. The proposed network builds a three-stage segmentation system, which integrates feature information from low resolution to high resolution and achieves feature refinement progressively. CSRNet contains two critical modules: the Shorted Pyramid Fusion Module (SPFM) and the Selective Resolution Module (SRM). The SPFM is a computationally efficient module to incorporate the global context information and significantly enlarge the receptive field at each stage. The SRM is designed to fuse multi-resolution feature maps with various receptive fields, which assigns soft channel attentions across the feature maps and helps to remedy the problem caused by multi-scale objects. Comprehensive experiments on two well-known datasets demonstrate that the proposed CSRNet effectively improves the performance for real-time segmentation.
翻译:由于许多实际应用(如自主车辆、机器人等)的需求不断增加,实时静音分解受到相当重视。现有的实时分解方法经常利用特征聚合来提高分解准确性,但是它们没有充分考虑到不同分辨率的特征信息,而网络的可接收领域相对有限,从而损害性能。为了解决这一问题,我们提议建立一个光连锁选择性解析网络(CSRNet),以便通过多种背景信息嵌入和增强功能集成来改进实时分解的性能。拟议的网络建立一个三阶段分解系统,将低分辨率的特征信息整合到高分辨率,并逐步实现特征精细化。CSRNet包含两个关键模块:短膜分解模块(SPFM)和选择性解析模块(SRM),这是一个计算高效的模块,以纳入全球背景信息,并大幅扩大每个阶段的可接收字段。SRM旨在将多分辨率特征地图与各种可接收域相结合,在地图中分配软通道关注,并逐步实现特征精细化。CSR网络包含两个关键模块,通过多尺度的功能模型,有效地展示所认识的实时实时改进的实时数据。