Ocean-going platforms are integrating high-resolution camera feeds for observation and navigation, producing a deluge of visual data. The volume and rate of this data collection can rapidly outpace researchers' abilities to process and analyze them. Recent advances in machine learning enable fast, sophisticated analysis of visual data, but have had limited success in the ocean due to lack of data set standardization, insufficient formatting, and aggregation of existing, expertly curated imagery for use by data scientists. To address this need, we have built FathomNet, a public platform that makes use of existing, expertly curated data. Initial efforts have leveraged MBARI's Video Annotation and Reference System and annotated deep sea video database, which has more than 7M annotations, 1M frame grabs, and 5k terms in the knowledgebase, with additional contributions by National Geographic Society (NGS) and NOAA's Office of Ocean Exploration and Research. FathomNet has over 160k localizations of 1.4k midwater and benthic classes, and contains more than 70k iconic and non-iconic views of marine animals, underwater equipment, debris, etc. We demonstrate how machine learning models trained on FathomNet data can be applied across different institutional video data, and enable automated acquisition and tracking of midwater animals using a remotely operated vehicle. As FathomNet continues to develop and incorporate more image data from other oceanographic community members, this effort will enable scientists, explorers, policymakers, storytellers, and the public to understand and care for our ocean.
翻译:海洋平台正在整合用于观测和导航的高分辨率摄像材料,生成大量视觉数据。这种数据收集的数量和速度可以迅速超过研究人员处理和分析这些数据的能力。最近机器学习的进展使得能够对视觉数据进行快速、精密的分析,但由于缺少数据集标准化、格式化不足以及现有专业整理图像的汇总,供数据科学家使用,海洋平台在海洋取得了有限的成功。为了满足这一需要,我们建立了FathomNet,这是一个公共平台,利用了现有、专业整理的数据。初步努力利用了MBARI视频注释和参考系统以及附加说明的深海视频数据库,该数据库拥有超过7M说明、1M框架抓取和5k项知识数据库,但由于缺乏数据集标准化、现有、专业整理的图像供数据科学家使用,因此,由于国家地理学会和诺阿海洋勘探和研究办公室的海洋勘探和研究办公室的额外贡献,在海洋网络中,我们建立了超过160k 的中层和底层水和底层级级的本地化,并包含了超过70k的海洋动物、水下层设备、碎片和附加的图象学观点,等等。我们利用了超过7M框架的海洋视频网络,在数据库中不断学习数据,我们将如何在数据库中不断进行数据学习。