Colorectal cancer (CRC) is a leading worldwide cause of cancer-related mortality, and the role of prompt precise detection is of paramount interest in improving patient outcomes. Conventional diagnostic methods such as colonoscopy and histological examination routinely exhibit subjectivity, are extremely time-consuming, and are susceptible to variation. Through the development of digital pathology, deep learning algorithms have become a powerful approach in enhancing diagnostic precision and efficiency. In our work, we proposed a convolutional neural network architecture named MSRANetV2, specially optimized for the classification of colorectal tissue images. The model employs a ResNet50V2 backbone, extended with residual attention mechanisms and squeeze-and-excitation (SE) blocks, to extract deep semantic and fine-grained spatial features. With channel alignment and upsampling operations, MSRANetV2 effectively fuses multi-scale representations, thereby enhancing the robustness of the classification. We evaluated our model on a five-fold stratified cross-validation strategy on two publicly available datasets: CRC-VAL-HE-7K and NCT-CRC-HE-100K. The proposed model achieved remarkable average Precision, recall, F1-score, AUC, and test accuracy were 0.9884 plus-minus 0.0151, 0.9900 plus-minus 0.0151, 0.9900 plus-minus 0.0145, 0.9999 plus-minus 0.00006, and 0.9905 plus-minus 0.0025 on the 7K dataset. On the 100K dataset, they were 0.9904 plus-minus 0.0091, 0.9900 plus-minus 0.0071, 0.9900 plus-minus 0.0071, 0.9997 plus-minus 0.00016, and 0.9902 plus-minus 0.0006. Additionally, Grad-CAM visualizations were incorporated to enhance model interpretability by highlighting tissue areas that are medically relevant. These findings validate that MSRANetV2 is a reliable, interpretable, and high-performing architectural model for classifying CRC tissues.
翻译:结直肠癌(CRC)是全球癌症相关死亡的主要原因之一,及时精确检测在改善患者预后方面至关重要。传统的诊断方法如结肠镜检和组织学检查通常存在主观性、耗时极长且易受变异影响。随着数字病理学的发展,深度学习算法已成为提升诊断精度和效率的有力途径。本研究提出了一种名为MSRANetV2的卷积神经网络架构,专门针对结直肠组织图像分类进行优化。该模型以ResNet50V2为骨干网络,通过引入残差注意力机制和挤压激励(SE)模块,提取深层语义和细粒度空间特征。借助通道对齐和上采样操作,MSRANetV2有效融合了多尺度表征,从而增强了分类的鲁棒性。我们在两个公开数据集(CRC-VAL-HE-7K和NCT-CRC-HE-100K)上采用五折分层交叉验证策略评估模型性能。在7K数据集上,所提模型的平均精确率、召回率、F1分数、AUC和测试准确率分别为0.9884 ± 0.0151、0.9900 ± 0.0151、0.9900 ± 0.0145、0.9999 ± 0.00006和0.9905 ± 0.0025;在100K数据集上则分别为0.9904 ± 0.0091、0.9900 ± 0.0071、0.9900 ± 0.0071、0.9997 ± 0.00016和0.9902 ± 0.0006。此外,通过引入Grad-CAM可视化技术,模型可突出显示具有医学意义的组织区域,从而增强可解释性。这些结果验证了MSRANetV2是一种可靠、可解释且高性能的结直肠癌组织分类架构模型。