The performance of existing underwater object detection methods degrades seriously when facing domain shift caused by complicated underwater environments. Due to the limitation of the number of domains in the dataset, deep detectors easily memorize a few seen domains, which leads to low generalization ability. There are two common ideas to improve the domain generalization performance. First, it can be inferred that the detector trained on as many domains as possible is domain-invariant. Second, for the images with the same semantic content in different domains, their hidden features should be equivalent. This paper further excavates these two ideas and proposes a domain generalization framework (named DMC) that learns how to generalize across domains from Domain Mixup and Contrastive Learning. First, based on the formation of underwater images, an image in an underwater environment is the linear transformation of another underwater environment. Thus, a style transfer model, which outputs a linear transformation matrix instead of the whole image, is proposed to transform images from one source domain to another, enriching the domain diversity of the training data. Second, mixup operation interpolates different domains on the feature level, sampling new domains on the domain manifold. Third, contrastive loss is selectively applied to features from different domains to force the model to learn domain invariant features but retain the discriminative capacity. With our method, detectors will be robust to domain shift. Also, a domain generalization benchmark S-UODAC2020 for detection is set up to measure the performance of our method. Comprehensive experiments on S-UODAC2020 and two object recognition benchmarks (PACS and VLCS) demonstrate that the proposed method is able to learn domain-invariant representations, and outperforms other domain generalization methods.
翻译:现有水下物体探测方法的性能在面临复杂的水下环境造成的域变时会严重退化。 由于数据集域数有限, 深探测器很容易对几个可见域进行记忆化, 从而导致一般化能力低。 有两种共同的想法可以改进域的概括性性性性能。 首先, 可以推断, 在尽可能多的域上受过训练的探测器是域内变异。 其次, 对于不同域内具有相同语义内容的图像, 它们隐藏的特性应该相等。 本文进一步挖掘了这两个想法, 并提议了一个域化框架( 名为 DMC ), 以学习如何在多曼混集和对比性学习的域间域间域间化。 首先, 根据水下图像的形成, 水下环境中的图像是另一个水下环境的线性变。 因此, 风格转换模型, 产生一个线性变矩阵, 而不是整个图像, 提议将图像从一个来源域向另一个域内转成一个20, 丰富培训基准的域多样化。 其次, 将不同对象的域间化操作对不同域域域域的域间化, 在域域域域域内取样内, 将显示新的域内域内域内测算, 性性变换成另一个的域内, 方法 将Sla变为一般的域内, 方法 将S 。 将S d 。 将S dro化法 系统制为一般的域域域内 方法 将进行为一般的域 将 。