Recent advancements in deep learning methods bring computer-assistance a step closer to fulfilling promises of safer surgical procedures. However, the generalizability of such methods is often dependent on training on diverse datasets from multiple medical institutions, which is a restrictive requirement considering the sensitive nature of medical data. Recently proposed collaborative learning methods such as Federated Learning (FL) allow for training on remote datasets without the need to explicitly share data. Even so, data annotation still represents a bottleneck, particularly in medicine and surgery where clinical expertise is often required. With these constraints in mind, we propose FedCy, a federated semi-supervised learning (FSSL) method that combines FL and self-supervised learning to exploit a decentralized dataset of both labeled and unlabeled videos, thereby improving performance on the task of surgical phase recognition. By leveraging temporal patterns in the labeled data, FedCy helps guide unsupervised training on unlabeled data towards learning task-specific features for phase recognition. We demonstrate significant performance gains over state-of-the-art FSSL methods on the task of automatic recognition of surgical phases using a newly collected multi-institutional dataset of laparoscopic cholecystectomy videos. Furthermore, we demonstrate that our approach also learns more generalizable features when tested on data from an unseen domain.
翻译:最近深层次学习方法的进展使计算机援助更接近于实现更安全的外科手术程序的承诺。然而,这类方法的普及性往往取决于对多个医疗机构不同数据集的培训,考虑到医疗数据的敏感性,这是一个限制性要求。最近提出的合作学习方法,如联邦学习联合会(FL)允许对远程数据集进行培训,而无需明确分享数据。即使如此,数据注释仍然代表着瓶颈,特别是在往往需要临床专门知识的医学和外科手术中。我们考虑到这些制约因素,建议FedCy采用一种联合的半监督学习(FSSL)方法,将FL和自我监督学习结合起来,以利用标签和无标签视频的分散数据集,从而改进外科阶段识别任务的绩效。通过利用标签数据的时间模式,FedCy帮助指导关于无标签数据的非统一培训,以学习特定任务的特点进行阶段识别。我们用最新、最先进的FSSLSL方法,即将FLSL和自我监督的学习结合起来,从而利用新收集的多层次数据系统,从我们新收集的多层次的视野中,从新收集的多层次的普通数据系统。