Visibility underwater is challenging, and degrades as the distance between the subject and camera increases, making vision tasks in the forward-looking direction more difficult. We have collected underwater forward-looking stereo-vision and visual-inertial image sets in the Mediterranean and Red Sea. To our knowledge there are no other public datasets in the underwater environment acquired with this camera-sensor orientation published with ground-truth. These datasets are critical for the development of several underwater applications, including obstacle avoidance, visual odometry, 3D tracking, Simultaneous Localization and Mapping (SLAM) and depth estimation. The stereo datasets include synchronized stereo images in dynamic underwater environments with objects of known-size. The visual-inertial datasets contain monocular images and IMU measurements, aligned with millisecond resolution timestamps and objects of known size which were placed in the scene. Both sensor configurations allow for scale estimation, with the calibrated baseline in the stereo setup and the IMU in the visual-inertial setup. Ground truth depth maps were created offline for both dataset types using photogrammetry. The ground truth is validated with multiple known measurements placed throughout the imaged environment. There are 5 stereo and 8 visual-inertial datasets in total, each containing thousands of images, with a range of different underwater visibility and ambient light conditions, natural and man-made structures and dynamic camera motions. The forward-looking orientation of the camera makes these datasets unique and ideal for testing underwater obstacle-avoidance algorithms and for navigation close to the seafloor in dynamic environments. With our datasets, we hope to encourage the advancement of autonomous functionality for underwater vehicles in dynamic and/or shallow water environments.
翻译:在地中海和红海,我们收集了水下前瞻性的立体影像和视觉内皮图像。 据我们所知,水下环境没有其他公共数据集,通过以地面真相发布摄像传感器和已知大小的物体而获得。这两个数据集对于开发几个水下应用程序至关重要,这些应用程序包括:障碍避免、视觉奥氏测量、3D跟踪、同步地平和映像(SLAM)和深度估计。立体数据集包括动态水下环境中的同步立体直线图像以及已知大小的视觉内观图像。视觉内层数据集包含单色图像和IMU测量数据,与在现场公布的毫秒分辨率定时标点和已知大小的物体相一致。这两个传感器配置都允许进行规模估算,在立体立立立的校准基线和视觉内置的IMU值。我们为离线的地面深度地图,在动态水下绘制了具有已知尺寸的直线直线直径直径直径直线图像,在地面图像中测量了各种直径直径直径的图像。</s>