Over the years, the paradigm of medical image analysis has shifted from manual expertise to automated systems, often using deep learning (DL) systems. The performance of deep learning algorithms is highly dependent on data quality. Particularly for the medical domain, it is an important aspect as medical data is very sensitive to quality and poor quality can lead to misdiagnosis. To improve the diagnostic performance, research has been done both in complex DL architectures and in improving data quality using dataset dependent static hyperparameters. However, the performance is still constrained due to data quality and overfitting of hyperparameters to a specific dataset. To overcome these issues, this paper proposes random data augmentation based enhancement. The main objective is to develop a generalized, data-independent and computationally efficient enhancement approach to improve medical data quality for DL. The quality is enhanced by improving the brightness and contrast of images. In contrast to the existing methods, our method generates enhancement hyperparameters randomly within a defined range, which makes it robust and prevents overfitting to a specific dataset. To evaluate the generalization of the proposed method, we use four medical datasets and compare its performance with state-of-the-art methods for both classification and segmentation tasks. For grayscale imagery, experiments have been performed with: COVID-19 chest X-ray, KiTS19, and for RGB imagery with: LC25000 datasets. Experimental results demonstrate that with the proposed enhancement methodology, DL architectures outperform other existing methods. Our code is publicly available at: https://github.com/aleemsidra/Augmentation-Based-Generalized-Enhancement
翻译:多年来,医学图像分析的范式已经从人工专业知识转向自动化系统,常常使用深层次学习(DL)系统。深层次学习算法的性能高度取决于数据质量。特别是医学领域,这是一个重要方面,因为医学数据对质量非常敏感,质量差会导致诊断错误。为了改进诊断性能,在复杂的DL结构中进行了研究,并且利用数据数据集依赖的静态超参数改进了数据质量。然而,由于数据质量和超分光度仪过度配置到特定的数据集,业绩仍然受到限制。为了克服这些问题,本文建议随机增加数据。主要目的是开发一个通用的、数据独立和计算高效的增强方法,以提高DL的医学数据质量。通过改善图像的亮度和对比,我们的方法与现有方法不同,我们的方法在确定的范围内随机增加超分光度,这使其强大,防止超分光度调整到特定的数据集。为了评估拟议方法的概括性化,我们使用四种医学数据升级/C级码来增强数据。