Disentangled representations can be useful in many downstream tasks, help to make deep learning models more interpretable, and allow for control over features of synthetically generated images that can be useful in training other models that require a large number of labelled or unlabelled data. Recently, flow-based generative models have been proposed to generate realistic images by directly modeling the data distribution with invertible functions. In this work, we propose a new flow-based generative model framework, named GLOWin, that is end-to-end invertible and able to learn disentangled representations. Feature disentanglement is achieved by factorizing the latent space into components such that each component learns the representation for one generative factor. Comprehensive experiments have been conducted to evaluate the proposed method on a public brain tumor MR dataset. Quantitative and qualitative results suggest that the proposed method is effective in disentangling the features from complex medical images.
翻译:在许多下游任务中,分解的表达方式可能有用,有助于使深层次学习模型更易于解释,并能够控制合成生成图像的特征,这些特征可用于培训需要大量贴标签或未贴标签数据的其他模型。最近,提出了基于流动的基因化模型,通过直接模拟数据分布和可视功能来产生现实的图像。在这项工作中,我们提出了一个新的以流动为基础的基因化模型框架,名为GLOWin,这是最终到最终的不可翻转的,能够学习分解的表达方式。通过将潜在空间纳入各个组成部分,使每个组成部分了解一个基因化要素的表示方式,从而实现特征分解。已经进行了全面实验,以评价公共脑瘤MR数据集的拟议方法。定量和定性结果表明,拟议的方法对于从复杂的医学图像中分离特征是有效的。