The performance of supervised deep learning algorithms depends significantly on the scale, quality and diversity of the data used for their training. Collecting and manually annotating large amount of data can be both time-consuming and costly tasks to perform. In the case of tasks related to visual human-centric perception, the collection and distribution of such data may also face restrictions due to legislation regarding privacy. In addition, the design and testing of complex systems, e.g., robots, which often employ deep learning-based perception models, may face severe difficulties as even state-of-the-art methods trained on real and large-scale datasets cannot always perform adequately due to not having been adapted to the visual differences between the virtual and the real world data. As an attempt to tackle and mitigate the effect of these issues, we present a method that automatically generates realistic synthetic data with annotations for a) person detection, b) face recognition, and c) human pose estimation. The proposed method takes as input real background images and populates them with human figures in various poses. Instead of using hand-made 3D human models, we propose the use of models generated through deep learning methods, further reducing the dataset creation costs, while maintaining a high level of realism. In addition, we provide open-source and easy to use tools that implement the proposed pipeline, allowing for generating highly-realistic synthetic datasets for a variety of tasks. A benchmarking and evaluation in the corresponding tasks shows that synthetic data can be effectively used as a supplement to real data.
翻译:监督深层学习算法的绩效在很大程度上取决于其培训所用数据的规模、质量和多样性。收集和人工说明大量数据可能既费时又费钱。在与视觉人类中心观念有关的任务中,这些数据的收集和传播也可能由于隐私方面的立法而面临限制。此外,复杂系统的设计和测试,例如往往采用深层次学习的认知模型的机器人,可能面临严重困难,因为即使是在实际和大规模数据集方面经过培训的最先进的方法,由于没有适应虚拟和真实世界数据之间的视觉差异,因此并不总是能够充分发挥作用。为了设法解决和减轻这些问题的影响,我们提出的方法是自动生成现实的合成数据,并附有个人检测说明,(b)面部识别,和(c)人造估计。拟议方法可以作为投入真实的背景图像,并以各种人造数字为背景。我们提议使用模型,而不是使用手制的3D人造模型,因此无法始终充分发挥作用,因为没有适应虚拟世界数据与真实世界数据之间的视觉差异差异。我们提出一种方法,为了解决和减轻这些问题的影响,我们提出了一个方法,可以自动生成现实的合成数据,同时进一步提供高层次数据生成工具,同时提供高层次的模型。