Large annotated datasets are required to train segmentation networks. In medical imaging, it is often difficult, time consuming and expensive to create such datasets, and it may also be difficult to share these datasets with other researchers. Different AI models can today generate very realistic synthetic images, which can potentially be openly shared as they do not belong to specific persons. However, recent work has shown that using synthetic images for training deep networks often leads to worse performance compared to using real images. Here we demonstrate that using synthetic images and annotations from an ensemble of 10 GANs, instead of from a single GAN, increases the Dice score on real test images with 4.7 % to 14.0 % on specific classes.
翻译:在医学成像中,创建这类数据集往往困难、费时和费钱,而且与其他研究人员分享这些数据集也可能很困难。不同的AI模型今天可以产生非常现实的合成图像,这些图像可能公开分享,因为它们不属于特定的人。然而,最近的工作表明,使用合成图像培训深层网络往往比使用真实图像造成更差的性能。在这里,我们证明,使用合成图像和10个GAN组合的注释,而不是单一的GAN,将实际测试图像的骰子分数提高4.7%至14.0%,具体类别。