The computer-aided diagnosis (CAD) system can provide a reference basis for the clinical diagnosis of skin diseases. Convolutional neural networks (CNNs) can not only extract visual elements such as colors and shapes but also semantic features. As such they have made great improvements in many tasks of dermoscopy images. The imaging of dermoscopy has no principal orientation, indicating that there are a large number of skin lesion rotations in the datasets. However, CNNs lack rotation invariance, which is bound to affect the robustness of CNNs against rotations. To tackle this issue, we propose a rotation meanout (RM) network to extract rotation-invariant features from dermoscopy images. In RM, each set of rotated feature maps corresponds to a set of outputs of the weight-sharing convolutions and they are fused using meanout strategy to obtain the final feature maps. Through theoretical derivation, the proposed RM network is rotation-equivariant and can extract rotation-invariant features when followed by the global average pooling (GAP) operation. The extracted rotation-invariant features can better represent the original data in classification and retrieval tasks for dermoscopy images. The RM is a general operation, which does not change the network structure or increase any parameter, and can be flexibly embedded in any part of CNNs. Extensive experiments are conducted on a dermoscopy image dataset. The results show our method outperforms other anti-rotation methods and achieves great improvements in dermoscopy image classification and retrieval tasks, indicating the potential of rotation invariance in the field of dermoscopy images.
翻译:计算机辅助诊断( CAD) 系统可以为皮肤疾病临床诊断提供参考依据。 革命神经网络( CNN) 不仅可以提取颜色和形状等视觉元素, 还可以提取语义特征。 因此, 它们极大地改进了许多脱温图像的任务。 脱温镜像的成像没有主要方向, 表明数据集中有大量的皮肤损伤旋转。 然而, CNN 网络缺乏旋转性变化, 这必然会影响CNN 相对于旋转的稳健性改进。 为了解决这个问题, 我们建议了一个旋转平均值( RM) 网络从脱温图像中提取旋转- 变异性特征。 在 RM 中, 每套旋转一套地图图图都与一套加权共享图象的输出相匹配。 通过理论推断, 拟议的 RM 网络网络是旋转性变异性, 并可在全球平均集( GAP) 操作中提取旋转性变异性变异性特征。 提取的RMDR 功能可以更好地显示常规图像的翻转动和变异性结构。 在外勤中, 提取的翻动性机和翻动性图像的运行中, 能够显示任何原始的翻动性操作, 将中, 或翻动的运行中, 或更动的运行中, 能够显示任何原的图像结构中, 任何原的变动的变动的变动式图图。