Each year, underwater remotely operated vehicles (ROVs) collect thousands of hours of video of unexplored ocean habitats revealing a plethora of information regarding biodiversity on Earth. However, fully utilizing this information remains a challenge as proper annotations and analysis require trained scientists time, which is both limited and costly. To this end, we present a Dataset for Underwater Substrate and Invertebrate Analysis (DUSIA), a benchmark suite and growing large-scale dataset to train, validate, and test methods for temporally localizing four underwater substrates as well as temporally and spatially localizing 59 underwater invertebrate species. DUSIA currently includes over ten hours of footage across 25 videos captured in 1080p at 30 fps by an ROV following pre planned transects across the ocean floor near the Channel Islands of California. Each video includes annotations indicating the start and end times of substrates across the video in addition to counts of species of interest. Some frames are annotated with precise bounding box locations for invertebrate species of interest, as seen in Figure 1. To our knowledge, DUSIA is the first dataset of its kind for deep sea exploration, with video from a moving camera, that includes substrate annotations and invertebrate species that are present at significant depths where sunlight does not penetrate. Additionally, we present the novel context-driven object detector (CDD) where we use explicit substrate classification to influence an object detection network to simultaneously predict a substrate and species class influenced by that substrate. We also present a method for improving training on partially annotated bounding box frames. Finally, we offer a baseline method for automating the counting of invertebrate species of interest.
翻译:每年,水下遥控潜水器(ROVs)收集数千小时未经探索的海洋生境视频,显示有关地球上生物多样性的信息过多。然而,充分利用这一信息仍是一项挑战,因为适当的说明和分析需要经过培训的科学家时间,这种时间有限,费用也很高。为此,我们提供了一套水下亚底和脊椎分析数据集(DUSIA),一个基准套件,以及越来越多的大型数据集,用于培训、验证和测试4个水下基体以及59个水下无脊椎物种的时间定位和空间定位。DUSIA目前包括25个视频的1080pp所摄取的超过10个小时的视频,在1080pp所摄取的30英尺上方方位。我们同时展示了在加利福尼亚海峡群岛附近海底进行剖析的25小时以上。每个视频包含一个说明,显示视频的开始和结束时间。一些框架配有精确的无脊椎动物物种的框框位置,如图1所示,DUSSIA是其深海勘探的首个目标的数据集,在1080pps摄像中拍摄到30英尺的25摄像带。 我们从地面勘测到现在的底的底底底底底底底的摄测法,,我们用一个不断勘测法, 将显示一个新的摄像底底的深度的深度的深度的摄像头的摄像器, 。我们使用一个深度的深度的深度的深度的深度的深度勘测法, 将显示的深度的深度的深度的深度的深度的深度的深度的深度的深度的深度的深度的深度,我们的深度的深度测量测算法。