Data augmentation has proven its usefulness to improve model generalization and performance. While it is commonly applied in computer vision application when it comes to multi-view systems, it is rarely used. Indeed geometric data augmentation can break the alignment among views. This is problematic since multi-view data tend to be scarce and it is expensive to annotate. In this work we propose to solve this issue by introducing a new multi-view data augmentation pipeline that preserves alignment among views. Additionally to traditional augmentation of the input image we also propose a second level of augmentation applied directly at the scene level. When combined with our simple multi-view detection model, our two-level augmentation pipeline outperforms all existing baselines by a significant margin on the two main multi-view multi-person detection datasets WILDTRACK and MultiviewX.
翻译:数据增强已证明有助于改进模型的概括化和性能。虽然在多视图系统中计算机的视觉应用中通常使用,但很少使用。事实上几何数据增强可以打破各种视图之间的对齐。由于多视图数据往往稀缺,因此存在问题,注释费用昂贵。在这项工作中,我们提议采用新的多视图数据增强管道来解决这个问题,从而保持各种视图之间的对齐。除了传统地扩大输入图像外,我们还提议在现场一级直接应用第二层次的增强。在与我们简单的多视图探测模型相结合时,我们两级的增强管道在两种主要的多视图多视图多人探测数据集WILLDTRACK和多视图X上大大超过所有现有基线。