Object detection at night is a challenging problem due to the absence of night image annotations. Despite several domain adaptation methods, achieving high-precision results remains an issue. False-positive error propagation is still observed in methods using the well-established student-teacher framework, particularly for small-scale and low-light objects. This paper proposes a two-phase consistency unsupervised domain adaptation network, 2PCNet, to address these issues. The network employs high-confidence bounding-box predictions from the teacher in the first phase and appends them to the student's region proposals for the teacher to re-evaluate in the second phase, resulting in a combination of high and low confidence pseudo-labels. The night images and pseudo-labels are scaled-down before being used as input to the student, providing stronger small-scale pseudo-labels. To address errors that arise from low-light regions and other night-related attributes in images, we propose a night-specific augmentation pipeline called NightAug. This pipeline involves applying random augmentations, such as glare, blur, and noise, to daytime images. Experiments on publicly available datasets demonstrate that our method achieves superior results to state-of-the-art methods by 20\%, and to supervised models trained directly on the target data.
翻译:----
物体检测在夜间是一个具有挑战性的问题,因为缺乏夜间图像的注释。尽管存在几种领域自适应的方法,但仍然存在高精度结果的问题。使用已建立的师生框架,特别是对于小规模和低光对象,仍然观察到误报错误的传播。本文提出了一个分为两个阶段的一致性无监督领域自适应网络2PCNet,以解决这些问题。网络在第一阶段使用来自教师的高置信度边界框预测,并将它们附加到教师重新评估的存储区域提议中,从而产生高置信度伪标签和低置信度伪标签的组合。夜间图像和伪标签在用作学生输入之前被缩小,从而提供更强的小规模伪标签。为了解决由于低光区域和其他夜间属性在图像中引起的错误,我们提出了一个称为NightAug的针对夜间的特定增强管道。该管道涉及将随机增强,如耀斑、模糊和噪声应用于日间图像。在公开可用的数据集上进行的实验表明,我们的方法比最先进的方法和直接在目标数据上训练的监督模型取得了优异的结果,精度提高了20\%。