The re-identification (ReID) of individuals over a complex network of cameras is a challenging task, especially under real-world surveillance conditions. Several deep learning models have been proposed for visible-infrared (V-I) person ReID to recognize individuals from images captured using RGB and IR cameras. However, performance may decline considerably if RGB and IR images captured at test time are corrupted (e.g., noise, blur, and weather conditions). Although various data augmentation (DA) methods have been explored to improve the generalization capacity, these are not adapted for V-I person ReID. In this paper, a specialized DA strategy is proposed to address this multimodal setting. Given both the V and I modalities, this strategy allows to diminish the impact of corruption on the accuracy of deep person ReID models. Corruption may be modality-specific, and an additional modality often provides complementary information. Our multimodal DA strategy is designed specifically to encourage modality collaboration and reinforce generalization capability. For instance, punctual masking of modalities forces the model to select the informative modality. Local DA is also explored for advanced selection of features within and among modalities. The impact of training baseline fusion models for V-I person ReID using the proposed multimodal DA strategy is assessed on corrupted versions of the SYSU-MM01, RegDB, and ThermalWORLD datasets in terms of complexity and efficiency. Results indicate that using our strategy provides V-I ReID models the ability to exploit both shared and individual modality knowledge so they can outperform models trained with no or unimodal DA. GitHub code: https://github.com/art2611/ML-MDA.
翻译:在复杂的照相机网络上对个人进行重新定位(ReID)是一项具有挑战性的任务,特别是在现实世界的监测条件下。已经为可见红外线(V-I)人员ReID提出了若干深层学习模式,以识别使用RGB和IR相机拍摄的图像中的个人;然而,如果测试时捕获的RGB和IR图像(例如噪音、模糊和天气条件)受到腐蚀,业绩可能会大大下降。虽然已经探索了各种数据增强(DA)方法来提高一般化能力,但这些方法并不适应V-I人员ReID。本文提出了针对这种多式联运设置的专门DA战略。鉴于V-I模式和I模式,该战略有助于减少腐败对深重个人ReID模型准确性的影响。腐败可能是特定模式,而另一种额外模式往往提供补充信息。我们的多式联运DA战略是鼓励模式合作和加强一般化能力。举例说,及时掩盖模式迫使选择信息模式。当地DADA还探讨如何在内部和各种模式中进一步选择非模式。根据经过培训的DAFR-R-RMR战略评估了标准化模型。