Domain Adaptive Object Detection (DAOD) focuses on improving the generalization ability of object detectors via knowledge transfer. Recent advances in DAOD strive to change the emphasis of the adaptation process from global to local in virtue of fine-grained feature alignment methods. However, both the global and local alignment approaches fail to capture the topological relations among different foreground objects as the explicit dependencies and interactions between and within domains are neglected. In this case, only seeking one-vs-one alignment does not necessarily ensure the precise knowledge transfer. Moreover, conventional alignment-based approaches may be vulnerable to catastrophic overfitting regarding those less transferable regions (e.g. backgrounds) due to the accumulation of inaccurate localization results in the target domain. To remedy these issues, we first formulate DAOD as an open-set domain adaptation problem, in which the foregrounds and backgrounds are seen as the ``known classes'' and ``unknown class'' respectively. Accordingly, we propose a new and general framework for DAOD, named Foreground-aware Graph-based Relational Reasoning (FGRR), which incorporates graph structures into the detection pipeline to explicitly model the intra- and inter-domain foreground object relations on both pixel and semantic spaces, thereby endowing the DAOD model with the capability of relational reasoning beyond the popular alignment-based paradigm. The inter-domain visual and semantic correlations are hierarchically modeled via bipartite graph structures, and the intra-domain relations are encoded via graph attention mechanisms. Empirical results demonstrate that the proposed FGRR exceeds the state-of-the-art performance on four DAOD benchmarks.
翻译:目标对象检测( DAOD) 侧重于通过知识转让提高物体探测器的通用能力。 DAOD 近期的进展是试图通过细微的特性调整方法将适应过程的重点从全球转向地方,但全球和地方的调整方法都未能捕捉不同前景对象之间的地形关系,因为忽视了不同前景对象之间的明显依赖和相互作用以及域内和域内的相互作用。在此情况下,只寻求一五一对齐并不一定能确保准确的知识转移。此外,由于目标域的本地化结果累积不准确,常规的基于校正的处理办法可能很容易在较不易转移区域(例如背景)上出现灾难性的超标。为了解决这些问题,我们首先将DAOD设计方法作为开放的域适应问题,其中的地貌和背景分别被视为“已知的阶级”和“未知的阶级之间的相互作用。因此,我们为DAOD提出了一个新的和一般的框架,名为FEF-S-SO-O-S-SL-S-Relor-Relational-Recal real realation (FGRR),它将直图结构结构结构结构结构结构结构结构结构结构结构结构结构与直图结构结构结构结构结构结构结构结构结构结构与直观结构结构结构与直观关系与直观关系与直观关系,通过模型关系,通过模型与直观-直观-直观-直观-直观-直观/DADALODALODODODRDRDRDRDR- 的模型关系,通过直图-直线路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路。