Self-supervised learning (SSL) methods are enabling an increasing number of deep learning models to be trained on image datasets in domains where labels are difficult to obtain. These methods, however, struggle to scale to the high resolution of medical imaging datasets, where they are critical for achieving good generalization on label-scarce medical image datasets. In this work, we propose the Histopathology DatasetGAN (HDGAN) framework, an extension of the DatasetGAN semi-supervised framework for image generation and segmentation that scales well to large-resolution histopathology images. We make several adaptations from the original framework, including updating the generative backbone, selectively extracting latent features from the generator, and switching to memory-mapped arrays. These changes reduce the memory consumption of the framework, improving its applicability to medical imaging domains. We evaluate HDGAN on a thrombotic microangiopathy high-resolution tile dataset, demonstrating strong performance on the high-resolution image-annotation generation task. We hope that this work enables more application of deep learning models to medical datasets, in addition to encouraging more exploration of self-supervised frameworks within the medical imaging domain.
翻译:自我监督的学习(SSL)方法使越来越多的深层次学习模型能够就标签难以获得的领域的图像数据集接受培训。但是,这些方法努力推广到医疗成像数据集的高分辨率分辨率,这对于在标签破碎的医学成像数据集上实现良好的概括化至关重要。在这项工作中,我们提议了“历史病理学数据系统”框架(HDGAN),这是数据系统GAN半监督的图像生成和分解框架的延伸,它与大分辨率的病理学图像相匹配。我们从最初的框架中做了一些调整,包括更新基因支柱,有选择地从发电机中提取潜在特征,并转换到存储式阵列。这些改变减少了框架的记忆消耗,提高了框架对医疗成像域的可适用性。我们用一个血栓式微分辨率高分辨率的显像仪数据集对HDGAN进行了评估,显示了高分辨率成像分辨图的强性。我们希望这项工作能够在医学成像框架内更多地应用深学习模型进行医学成像,同时鼓励进行自我探索。