Estimating depth from endoscopic images is a pre-requisite for a wide set of AI-assisted technologies, namely accurate localization, measurement of tumors, or identification of non-inspected areas. As the domain specificity of colonoscopies -- a deformable low-texture environment with fluids, poor lighting conditions and abrupt sensor motions -- pose challenges to multi-view approaches, single-view depth learning stands out as a promising line of research. In this paper, we explore for the first time Bayesian deep networks for single-view depth estimation in colonoscopies. Their uncertainty quantification offers great potential for such a critical application area. Our specific contribution is two-fold: 1) an exhaustive analysis of Bayesian deep networks for depth estimation in three different datasets, highlighting challenges and conclusions regarding synthetic-to-real domain changes and supervised vs. self-supervised methods; and 2) a novel teacher-student approach to deep depth learning that takes into account the teacher uncertainty.
翻译:从内窥镜图像中估计深度是一套广泛的人工智能辅助技术的前提,即准确定位、肿瘤测量或确定未受检查地区。作为结肠镜的域特性 -- -- 一种可变形的低脂环境,有液体、照明条件差和突然感官运动 -- -- 对多种观点方法构成挑战,单视深度学习是一线有希望的研究。在本文中,我们首次探索了巴耶斯深海网络,对结肠镜进行单视深度估计,其不确定性的量化为这样一个关键应用领域提供了巨大的潜力。我们的具体贡献有两重:(1) 对巴耶斯深海网络进行详尽的分析,以在三个不同的数据集中进行深度估计,突出关于合成到现实领域变化的挑战和结论,并监督与自我监督的方法;(2) 一种考虑到教师不确定性的深层学习新师范方法。