Computer assisted diagnosis in digital pathology is becoming ubiquitous as it can provide more efficient and objective healthcare diagnostics. Recent advances have shown that the convolutional Neural Network (CNN) architectures, a well-established deep learning paradigm, can be used to design a Computer Aided Diagnostic (CAD) System for breast cancer detection. However, the challenges due to stain variability and the effect of stain normalization with such deep learning frameworks are yet to be well explored. Moreover, performance analysis with arguably more efficient network models, which may be important for high throughput screening, is also not well explored.To address this challenge, we consider some contemporary CNN models for binary classification of breast histopathology images that involves (1) the data preprocessing with stain normalized images using an adaptive colour deconvolution (ACD) based color normalization algorithm to handle the stain variabilities; and (2) applying transfer learning based training of some arguably more efficient CNN models, namely Visual Geometry Group Network (VGG16), MobileNet and EfficientNet. We have validated the trained CNN networks on a publicly available BreaKHis dataset, for 200x and 400x magnified histopathology images. The experimental analysis shows that pretrained networks in most cases yield better quality results on data augmented breast histopathology images with stain normalization, than the case without stain normalization. Further, we evaluated the performance and efficiency of popular lightweight networks using stain normalized images and found that EfficientNet outperforms VGG16 and MobileNet in terms of test accuracy and F1 Score. We observed that efficiency in terms of test time is better in EfficientNet than other networks; VGG Net, MobileNet, without much drop in the classification accuracy.
翻译:在数字病理学方面,计算机协助诊断的数字病理学正在变得无处不在,因为它可以提供更有效率和客观的保健诊断。最近的进展表明,进化神经网络结构(CNN)结构,这种结构是根深蒂固的深层学习范式,可以用来设计计算机辅助诊断系统,用于乳腺癌检测;然而,由于变异性以及这种深层次学习框架的污点正常化的影响,尚未充分探讨。此外,利用效率更高的网络模型进行绩效分析,这或许对通过量的精度筛选很重要。为了应对这一挑战,我们考虑一些现代CNN模型,用于乳腺病理学图像的二元性分类,其中包括(1) 使用适应性色变异性变异性(CAD)的变异性化图像,以染色变异性(CACD)的变本化图像进行数据预处理;(2) 应用效率更高的CNN模型的转移学习方法,即视觉测地小组网络(VGGI16)、移动网络和高效的网络。我们验证了经过训练的CNN网络网络在公开提供的BreKs Stal Stal Stal Stality 数据中,在200xeral Stality网络中用最精确的精化的精化的精化的精化的精化数据分析中,在不进行更精确性分析。