Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any domain without the need for photo-realistic techniques manually generated by expensive teams of 3D artists. We introduce a domain randomization approach for object counting based on synthetic datasets that are quick and inexpensive to generate. We deliberately avoid photorealism and drastically increase the variability of the dataset, producing images with random textures and 3D transformations, which improves generalization. Experiments show that our method facilitates good performance on various real word object counting datasets for multiple domains: people, vehicles, penguins, and fruit. The source code is available at: https://github.com/enric1994/dr4oc
翻译:最近,以游戏引擎为基础的合成数据集的使用已证明可以改善计算机视觉中若干任务的业绩。然而,这些数据集通常只适用于计算机游戏中描述的具体领域,例如涉及车辆和人员的城市场景。在本文中,我们提出了一个方法,用于生成合成数据集,用于计算任何领域的天体,而不需要由3D艺术家的昂贵团队手工制作的摄影现实技术。我们采用了基于快速和廉价生成的合成数据集进行天体计的域随机化方法。我们有意避免光现实主义,并大大增加数据集的变异性,用随机的纹理和3D变形生成图像,从而改进一般化。实验表明,我们的方法有利于在多种领域(人、车辆、企鹅和水果)的各种真字天体物体计数据集上取得良好的性能。源代码见:https://github.com/enric1994/dr4oc。