Weakly supervised object localization (WSOL) focuses on localizing objects only with the supervision of image-level classification masks. Most previous WSOL methods follow the classification activation map (CAM) that localizes objects based on the classification structure with the multi-instance learning (MIL) mechanism. However, the MIL mechanism makes CAM only activate discriminative object parts rather than the whole object, weakening its performance for localizing objects. To avoid this problem, this work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects. Under this perspective, a DA-WSOL pipeline is designed to better engage DA approaches into WSOL to enhance localization performance. It utilizes a proposed target sampling strategy to select different types of target samples. Based on these types of target samples, domain adaption localization (DAL) loss is elaborated. It aligns the feature distribution between the two domains by DA and makes the estimator perceive target domain cues by Universum regularization. Experiments show that our pipeline outperforms SOTA methods on multi benchmarks. Code are released at \url{https://github.com/zh460045050/DA-WSOL_CVPR2022}.
翻译:微弱监督对象本地化(WSOL)仅以图像级分类面具的监管为主,将目标定位为本地化对象; 过去的大多数WSOL方法都遵循分类激活图(CAM),该图将基于分类结构的物体与多因子学习(MIL)机制本地化为本地化;然而,MIL机制使CAM只激活歧视对象部件,而不是整个对象,从而削弱其定位对象的性能;为了避免这一问题,这项工作提供了一个新颖的视角,即WSSOL模型作为域适应(DA)任务,由源/图像域培训的评分天分天分在目标/像素域进行测试,以定位对象。在这个视角下,DA-SOL管道设计了一个更好地将DA方法与SWOL连接到本地化绩效;利用拟议的目标取样战略选择了不同类型的目标样品;根据这些类型,详细拟订了域调整本地本地化(DAL)20损失。它与DADA的两个域的地貌分布相匹配,并使估测官在目标域域域域域域内看到目标域域域标标指向目标域域域域域,由UVVSOVS-RCVSO/DA50/DA格式规范化)。