Strategies that include the generation of synthetic data are beginning to be viable as obtaining real data can be logistically complicated, very expensive or slow. Not only the capture of the data can lead to complications, but also its annotation. To achieve high-fidelity data for training intelligent systems, we have built a 3D scenario and set-up to resemble reality as closely as possible. With our approach, it is possible to configure and vary parameters to add randomness to the scene and, in this way, allow variation in data, which is so important in the construction of a dataset. Besides, the annotation task is already included in the data generation exercise, rather than being a post-capture task, which can save a lot of resources. We present the process and concept of synthetic data generation in an automotive context, specifically for driver and passenger monitoring purposes, as an alternative to real data capturing.
翻译:包括合成数据生成在内的战略开始变得可行,因为获取真实数据在后勤上可能非常复杂、费用昂贵或缓慢。不仅数据捕获可能导致并发症,而且其注释性也会发生。为了为培训智能系统而获得高贞洁度数据,我们建立了一个三维假设和设置,以尽可能接近现实。通过我们的方法,可以配置和改变参数,增加现场随机性,从而允许数据的变化,这对构建数据集非常重要。此外,在数据生成工作中已经包括了说明性任务,而不是事后任务,这可以节省大量资源。我们介绍了在汽车环境下合成数据生成的过程和概念,特别是用于司机和乘客监测目的,作为真实数据采集的替代方法。