Autonomous Underwater Vehicles (AUVs) conduct regular visual surveys of marine environments to characterise and monitor the composition and diversity of the benthos. The use of machine learning classifiers for this task is limited by the low numbers of annotations available and the many fine-grained classes involved. In addition to these challenges, there are domain shifts between image sets acquired during different AUV surveys due to changes in camera systems, imaging altitude, illumination and water column properties leading to a drop in classification performance for images from a different survey where some or all these elements may have changed. This paper proposes a framework to improve the performance of a benthic morphospecies classifier when used to classify images from a different survey compared to the training data. We adapt the SymmNet state-of-the-art Unsupervised Domain Adaptation method with an efficient bilinear pooling layer and image scaling to normalise spatial resolution, and show improved classification accuracy. We test our approach on two datasets with images from AUV surveys with different imaging payloads and locations. The results show that generic domain adaptation can be enhanced to produce a significant increase in accuracy for images from an AUV survey that differs from the training images.
翻译:自主水下车(AUV)定期进行视觉调查以表征和监测海洋环境的底栖动物的组成和多样性。使用机器学习分类器进行此任务受到可用注释数量的限制以及涉及许多细粒度类别的影响。除了这些挑战之外,由于相机系统、拍摄高度、照明和水柱特性的变化,不同AUV调查期间获取的图像集之间存在域转移,导致在对来自不同调查的图像进行分类时,分类性能会降低,其中一些或全部这些元素可能已经发生改变。本文提出了一种框架,用于在对来自不同调查的图像进行分类时,提高底栖形态物种分类器的性能。我们使用高效的双线性池化层和图像缩放来适应SymmNet最先进的无监督域适应方法,从而提高分类准确性。我们在两个数据集上测试了我们的方法,这些数据集包含了具有不同成像载荷和位置的AUV调查图像。结果表明,通用域适应可以被改进以产生对于来自不同调查图像的分类准确性显著提高。