Medical image classification is a critical task in healthcare, enabling accurate and timely diagnosis. However, deploying deep learning models on resource-constrained edge devices presents significant challenges due to computational and memory limitations. This research investigates a resource-efficient approach to medical image classification by employing model quantization techniques. Quantization reduces the precision of model parameters and activations, significantly lowering computational overhead and memory requirements without sacrificing classification accuracy. The study focuses on the optimization of quantization-aware training (QAT) and post-training quantization (PTQ) methods tailored for edge devices, analyzing their impact on model performance across medical imaging datasets. Experimental results demonstrate that quantized models achieve substantial reductions in model size and inference latency, enabling real-time processing on edge hardware while maintaining clinically acceptable diagnostic accuracy. This work provides a practical pathway for deploying AI-driven medical diagnostics in remote and resource-limited settings, enhancing the accessibility and scalability of healthcare technologies.
翻译:医学图像分类是医疗健康领域的一项关键任务,能够实现准确及时的诊断。然而,由于计算和内存资源的限制,在资源受限的边缘设备上部署深度学习模型面临重大挑战。本研究通过采用模型量化技术,探索了一种资源高效的医学图像分类方法。量化通过降低模型参数和激活值的精度,在基本不牺牲分类准确性的前提下,显著降低了计算开销和内存需求。本研究重点优化了针对边缘设备定制的量化感知训练(QAT)与训练后量化(PTQ)方法,并分析了它们在医学影像数据集上对模型性能的影响。实验结果表明,量化模型在模型大小和推理延迟方面实现了显著降低,能够在边缘硬件上实现实时处理,同时保持临床可接受的诊断准确性。这项工作为在偏远和资源有限的环境中部署人工智能驱动的医学诊断提供了实用路径,从而提升了医疗技术的可及性和可扩展性。