For many practical problems and applications, it is not feasible to create a vast and accurately labeled dataset, which restricts the application of deep learning in many areas. Semi-supervised learning algorithms intend to improve performance by also leveraging unlabeled data. This is very valuable for 2D-pose estimation task where data labeling requires substantial time and is subject to noise. This work aims to investigate if semi-supervised learning techniques can achieve acceptable performance level that makes using these algorithms during training justifiable. To this end, a lightweight network architecture is introduced and mean teacher, virtual adversarial training and pseudo-labeling algorithms are evaluated on 2D-pose estimation for surgical instruments. For the applicability of pseudo-labelling algorithm, we propose a novel confidence measure, total variation. Experimental results show that utilization of semi-supervised learning improves the performance on unseen geometries drastically while maintaining high accuracy for seen geometries. For RMIT benchmark, our lightweight architecture outperforms state-of-the-art with supervised learning. For Endovis benchmark, pseudo-labelling algorithm improves the supervised baseline achieving the new state-of-the-art performance.
翻译:对于许多实际问题和应用程序,建立广泛而准确的标签数据集是不可行的,因为这限制了许多领域深层学习的应用。半监督的学习算法打算通过同时利用无标签数据来改进业绩。这对于数据标签需要大量时间并受到噪音影响的 2D 定位估计任务非常宝贵。这项工作旨在调查半监督的学习技术是否能够达到可接受的性能水平,从而使在培训期间使用这些算法成为合理的。为此,引入了轻量级网络结构,对2D 应用估计外科仪器进行了平均的师资、虚拟对抗培训和假标签算法。关于假标签算法的适用性,我们提出了新的信任度尺度、全面变异性。实验结果显示,使用半监督的学习可以极大地改善看不见的地理气象学的性能,同时保持高精确度。关于REMT基准,我们的轻量级结构在有监督的学习方面优于最新水平。关于Endovis的基准,假标签算法改进了监督的基线,以达到新的状态性能。