Supervised training of neural networks requires large, diverse and well annotated data sets. In the medical field, this is often difficult to achieve due to constraints in time, expert knowledge and prevalence of an event. Artificial data augmentation can help to prevent overfitting and improve the detection of rare events as well as overall performance. However, most augmentation techniques use purely spatial transformations, which are not sufficient for video data with temporal correlations. In this paper, we present a novel methodology for workflow augmentation and demonstrate its benefit for event recognition in cataract surgery. The proposed approach increases the frequency of event alternation by creating artificial videos. The original video is split into event segments and a workflow graph is extracted from the original annotations. Finally, the segments are assembled into new videos based on the workflow graph. Compared to the original videos, the frequency of event alternation in the augmented cataract surgery videos increased by 26%. Further, a 3% higher classification accuracy and a 7.8% higher precision was achieved compared to a state-of-the-art approach. Our approach is particularly helpful to increase the occurrence of rare but important events and can be applied to a large variety of use cases.
翻译:对神经网络的监督培训需要大量、多样和有良好说明的数据集。在医疗领域,由于时间限制、专家知识和事件流行程度的限制,这往往难以实现。人工数据增强有助于防止对稀有事件的探测和总体性能的过度和改进。然而,大多数增强技术使用纯空间变换,这些变换不足以与时间相关性的视频数据。在本文中,我们介绍了工作流程增强的新方法,并展示了其在白内障外科手术中事件识别的好处。拟议方法通过制作人工视频增加事件变换频率。原始视频分为事件片段,从原始说明中提取工作流程图。最后,根据工作流程图将部分组装成新的视频。与原始视频相比,扩大的白内障外科手术视频中事件变换频率增加了26%。此外,与最新方法相比,分类精确度提高了3%,精确度提高了7.8%。我们的方法特别有助于增加稀有但重要的事件的发生,可以应用于大量使用的案例。