Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source domain with rich labeled data to a new target domain with unlabeled data. Previous works focus on improving the domain adaptability of region-based detectors, e.g., Faster-RCNN, through matching cross-domain instance-level features that are explicitly extracted from a region proposal network (RPN). However, this is unsuitable for region-free detectors such as single shot detector (SSD), which perform a dense prediction from all possible locations in an image and do not have the RPN to encode such instance-level features. As a result, they fail to align important image regions and crucial instance-level features between the domains of region-free detectors. In this work, we propose an adversarial module to strengthen the cross-domain matching of instance-level features for region-free detectors. Firstly, to emphasize the important regions of image, the DSEM learns to predict a transferable foreground enhancement mask that can be utilized to suppress the background disturbance in an image. Secondly, considering that region-free detectors recognize objects of different scales using multi-scale feature maps, the DSEM encodes both multi-level semantic representations and multi-instance spatial-contextual relationships across different domains. Finally, the DSEM is pluggable into different region-free detectors, ultimately achieving the densely semantic feature matching via adversarial learning. Extensive experiments have been conducted on PASCAL VOC, Clipart, Comic, Watercolor, and FoggyCityscape benchmarks, and their results well demonstrate that the proposed approach not only improves the domain adaptability of region-free detectors but also outperforms existing domain adaptive region-based detectors under various domain shift settings.
翻译:未经监督的域域适应对象检测旨在将原始源域受过良好训练的探测器从原始源域调适为具有丰富的标签数据、带有未贴标签数据的新目标域。 先前的工作重点是通过匹配从区域建议网络(RPN)明确提取的跨度实例级特征,改进基于区域检测器(例如Apper-RCNN)的域适应性。 然而,这不适合无区域检测器,例如单射探测器(SSD),它从图像中的所有可能地点进行密集预测,而没有 RPN 来编码此类实例级特征。 结果,它们未能在无区域探测器的域间对重要图像区和关键实例级功能进行匹配。 在这项工作中,我们提议了一个对抗模块,以加强无区域检测器级别检测器的跨度匹配。 首先,为了强调重要的图像区域, DSEM 学会预测可转移的地面增强值掩罩,只能用来抑制图像中的背景级级变异度,但又考虑到区域无区域级探测器通过多度的域域域域内变变, 的多域域域域内变变变, 的域域域域域域域域域域域域域域域域内变变, 显示为最后的域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域域内的域内的域域间变的域间变变的域间变的域域域域间变。