This paper provides a comparison of current video content extraction tools with a focus on comparing commercial task-based machine learning services. Video intelligence (VIDINT) data has become a critical intelligence source in the past decade. The need for AI-based analytics and automation tools to extract and structure content from video has quickly become a priority for organizations needing to search, analyze and exploit video at scale. With rapid growth in machine learning technology, the maturity of machine transcription, machine translation, topic tagging, and object recognition tasks are improving at an exponential rate, breaking performance records in speed and accuracy as new applications evolve. Each section of this paper reviews and compares products, software resources and video analytics capabilities based on tasks relevant to extracting information from video with machine learning techniques.
翻译:本文件比较了目前的视频内容提取工具,重点是比较基于商业任务的机器学习服务。在过去十年中,视频情报(VIDINT)数据已成为一个重要的情报来源。需要基于AI的分析和自动化工具从视频中提取内容和结构内容,这很快成为需要大规模搜索、分析和利用视频的组织的一个优先事项。随着机器学习技术的迅速增长,机器抄录、机器翻译、专题标记和物体识别任务的成熟程度正在以指数速度提高,随着新应用的演变,业绩记录的速度和准确性都破碎了。本文每一节根据从机器学习技术的视频中提取信息的任务,对产品、软件资源和视频分析能力进行审查和比较。