3D reconstruction is a useful tool for surgical planning and guidance. However, the lack of available medical data stunts research and development in this field, as supervised deep learning methods for accurate disparity estimation rely heavily on large datasets containing ground truth information. Alternative approaches to supervision have been explored, such as self-supervision, which can reduce or remove entirely the need for ground truth. However, no proposed alternatives have demonstrated performance capabilities close to what would be expected from a supervised setup. This work aims to alleviate this issue. In this paper, we investigate the learning of structured light projections to enhance the development of direct disparity estimation networks. We show for the first time that it is possible to accurately learn the projection of structured light on a scene, implicitly learning disparity. Secondly, we \textcolor{black}{explore the use of a multi task learning (MTL) framework for the joint training of structured light and disparity. We present results which show that MTL with structured light improves disparity training; without increasing the number of model parameters. Our MTL setup outperformed the single task learning (STL) network in every validation test. Notably, in the medical generalisation test, the STL error was 1.4 times worse than that of the best MTL performance. The benefit of using MTL is emphasised when the training data is limited.} A dataset containing stereoscopic images, disparity maps and structured light projections on medical phantoms and ex vivo tissue was created for evaluation together with virtual scenes. This dataset will be made publicly available in the future.
翻译:3D重建是外科手术规划和指导的有用工具。然而,缺乏这方面的现有医疗数据特效研究与开发,因为准确差异估算的深层次学习方法在很大程度上依赖于包含地面真相信息的大型数据集。探索了替代监督方法,如自我监督,这可以减少或完全消除对地面真相的需求。然而,没有任何拟议的替代方法显示出接近监督架构所预期的性能。这项工作旨在缓解这一问题。在本文件中,我们调查了为改进直接差异估计网络的发展而进行结构化的光学预测的学习。我们第一次显示,能够准确了解现场结构化光学的预测,隐含着学习差异。第二,我们探索了自我监督的替代方法,如自我监督,可以减少或完全消除对地面真相的需要。然而,我们提出的结果显示,MTL的性能能力接近于一个有条理的光线培训,而没有增加模型参数的数量。我们的MTL设置了未来任务学习网络在每次校验测试中都能够准确了解一个场景结构化的光线,隐含着学习差异。第二,我们利用有限的前期数据测试中最差的STMTL数据测试是用于最差的数据。