Real-world images used for training machine learning algorithms are often unstructured and inconsistent. The process of analysing and tagging these images can be costly and error prone (also availability, gaps and legal conundrums). However, as we demonstrate in this article, the potential to generate accurate graphical images that are indistinguishable from real-world sources has a multitude of benefits in machine learning paradigms. One such example of this is football data from broadcast services (television and other streaming media sources). The football games are usually recorded from multiple sources (cameras and phones) and resolutions, not to mention, occlusion of visual details and other artefacts (like blurring, weathering and lighting conditions) which make it difficult to accurately identify features. We demonstrate an approach which is able to overcome these limitations using generated tagged and structured images. The generated images are able to simulate a variety views and conditions (including noise and blurring) which may only occur sporadically in real-world data and make it difficult for machine learning algorithm to 'cope' with these unforeseen problems in real-data. This approach enables us to rapidly train and prepare a robust solution that accurately extracts features (e.g., spacial locations, markers on the pitch, player positions, ball location and camera FOV) from real-world football match sources for analytical purposes.
翻译:用于培训机器学习算法的实时世界图像往往没有结构,而且不一致。分析和标记这些图像的过程可能成本高,而且容易出错(还有可用性、差距和法律难题)。然而,正如我们在本篇文章中显示的那样,生成与现实世界来源无法区分的准确图形图像的潜力在机器学习范式方面有许多好处。这方面的一个例子就是广播服务(电视和其他流媒体来源)提供的足球数据。足球运动通常从多种来源(摄像头和电话)和分辨率记录下来,更不用提的是,隐蔽视觉细节和其他工艺(如模糊、天气和照明条件),难以准确识别特征。我们展示了一种能够利用制作的有标签和结构的图像克服这些局限性的方法。产生的图像能够模拟各种观点和条件(包括噪音和模糊),而这些在现实世界数据中可能只是偶尔出现,而且机器学习算法很难与这些不可预见的问题相匹配。这一方法使我们能够快速地培训和准备一种强有力的解决方案,从真实的地球空间定位、精确的定位定位(定位)定位和定位(定位)的磁标和定位。