Data has become the most valuable resource in today's world. With the massive proliferation of data-driven algorithms, such as deep learning-based approaches, the availability of data is of great interest. In this context, high-quality training, validation and testing datasets are particularly needed. Volumetric data is a very important resource in medicine, as it ranges from disease diagnoses to therapy monitoring. When the dataset is sufficient, models can be trained to help doctors with these tasks. Unfortunately, there are scenarios and applications where large amounts of data is unavailable. For example, in the medical field, rare diseases and privacy issues can lead to restricted data availability. In non-medical fields, the high cost of obtaining a sufficient amount of high-quality data can also be a concern. A solution to these problems can be the generation of synthetic data to perform data augmentation in combination with other more traditional methods of data augmentation. Therefore, most of the publications on 3D Generative Adversarial Networks (GANs) are within the medical domain. The existence of mechanisms to generate realistic synthetic data is a good asset to overcome this challenge, especially in healthcare, as the data must be of good quality and close to reality, i.e. realistic, and without privacy issues. In this review, we provide a summary of works that generate realistic 3D synthetic data using GANs. We therefore outline GAN-based methods in these areas with common architectures, advantages and disadvantages. We present a novel taxonomy, evaluations, challenges and research opportunities to provide a holistic overview of the current state of GANs in medicine and other fields.
翻译:数据已成为当今世界最宝贵的资源。由于数据驱动的算法,如深层次的学习方法的大规模扩散,数据的供应极受关注。在这方面,特别需要高质量的培训、验证和测试数据集。量量数据是医学中非常重要的资源,因为它从疾病诊断到治疗监测不等。因此,当数据集充足时,可以对模型进行培训,以帮助医生完成这些任务。不幸的是,有些假设和应用程序缺乏大量数据。例如,在医疗领域,罕见的疾病和隐私问题可能导致数据获取受限。在非医疗领域,获得足够数量的高质量数据的成本很高,这也是一个令人关切的问题。这些问题的解决方法可以是合成数据生成数据,以便与其他更传统的数据增强方法相结合,从而增加数据。因此,在3D Geneantarial Aversarial网络(GANs)上的大多数出版物都属于医学领域。因此,产生现实的合成数据机制的存在是克服这一挑战的好资产,特别是在医疗保健领域,特别是保健领域,获得足够数量的高质量数据的高成本成本也可能是一个问题。这些问题的解决方案可能是合成数据生成的GAN大纲,我们必须利用现实的高质量和接近的GAN领域提供一种数据。