Recently, horizontal representation-based panoramic semantic segmentation approaches outperform projection-based solutions, because the distortions can be effectively removed by compressing the spherical data in the vertical direction. However, these methods ignore the distortion distribution prior and are limited to unbalanced receptive fields, e.g., the receptive fields are sufficient in the vertical direction and insufficient in the horizontal direction. Differently, a vertical representation compressed in another direction can offer implicit distortion prior and enlarge horizontal receptive fields. In this paper, we combine the two different representations and propose a novel 360{\deg} semantic segmentation solution from a complementary perspective. Our network comprises three modules: a feature extraction module, a bi-directional compression module, and an ensemble decoding module. First, we extract multi-scale features from a panorama. Then, a bi-directional compression module is designed to compress features into two complementary low-dimensional representations, which provide content perception and distortion prior. Furthermore, to facilitate the fusion of bi-directional features, we design a unique self distillation strategy in the ensemble decoding module to enhance the interaction of different features and further improve the performance. Experimental results show that our approach outperforms the state-of-the-art solutions with at least 10\% improvement on quantitative evaluations while displaying the best performance on visual appearance.
翻译:最近,基于横向代表的全方位语义分解方法超越了基于预测的解决方案,因为扭曲可以通过压缩垂直方向的球形数据来有效消除。然而,这些方法忽略了先前的扭曲分布,并局限于不平衡的可接受字段,例如,接受字段在垂直方向上是足够的,在横向方向上是不足的。不同的是,在另一个方向上压缩的垂直表达方式可以提供隐性扭曲,在前一个方向上是扩大的横向可接受字段。在本文中,我们结合了两个不同的表达方式,从互补的角度提出了一个新的360\deg}语义分解解决方案。我们的网络由三个模块组成:一个功能提取模块,一个双向压缩模块,这些模块将忽略之前的扭曲分布,并且局限于三个模块:一个双向压缩模块,这些模块可以将不同功能压缩成两个互为补充的低维表达方式,提供内容感知觉和扭曲。此外,为了便利双向特征的融合,我们网络由三个模块组成:一个功能提取模块,一个双向压缩模块压缩模块,一个双向压缩模块,在最小的图像压缩模块解码解析模块中设计一个独特的自我精化战略,同时改进10度模块将改进我们最差的演示式的演示式格式的演示式模式的演示式模式,以显示10度模式的演示式格式的演示式的演示式模式,以强化了10的演示式格式的演示式格式的演示式模式的演示式的演示。