Image-level weakly supervised semantic segmentation (WSSS) is a fundamental yet challenging computer vision task facilitating scene understanding and automatic driving. Most existing methods resort to classification-based Class Activation Maps (CAMs) to play as the initial pseudo labels, which tend to focus on the discriminative image regions and lack customized characteristics for the segmentation task. To alleviate this issue, we propose a novel activation modulation and recalibration (AMR) scheme, which leverages a spotlight branch and a compensation branch to obtain weighted CAMs that can provide recalibration supervision and task-specific concepts. Specifically, an attention modulation module (AMM) is employed to rearrange the distribution of feature importance from the channel-spatial sequential perspective, which helps to explicitly model channel-wise interdependencies and spatial encodings to adaptively modulate segmentation-oriented activation responses. Furthermore, we introduce a cross pseudo supervision for dual branches, which can be regarded as a semantic similar regularization to mutually refine two branches. Extensive experiments show that AMR establishes a new state-of-the-art performance on the PASCAL VOC 2012 dataset, surpassing not only current methods trained with the image-level of supervision but also some methods relying on stronger supervision, such as saliency label. Experiments also reveal that our scheme is plug-and-play and can be incorporated with other approaches to boost their performance.
翻译:为缓解这一问题,我们建议采用新型的激活调节和校正(AMR)办法,利用一个焦点分支和一个补偿分支获得一个加权的CAM,以提供对称监督和任务特定概念。具体地说,使用一个关注调节模块(AMM)来重新安排频道空间连续定位角度的地物重要性分布,该模块有助于为适应性调节分解的启动响应建立明确的示范渠道性相互依存关系和空间编码。此外,我们为两分支引入一个交叉的假监管机制,这可以被视为一种与相互完善的两个分支相似的语义规范。 广泛的实验显示,AMR建立了一个新的州级的升级监管模式,而不是升级的升级的演示模式。