Among the machine learning approaches applied in computer vision, Convolutional Neural Network (CNN) is widely used in the field of image recognition. However, although existing CNN models have been proven to be efficient, it is not easy to find a network architecture with better performance. Some studies choose to optimize the network architecture, while others chose to optimize the hyperparameters, such as the number and size of convolutional kernels, convolutional strides, pooling size, etc. Most of them are designed manually, which requires relevant expertise and takes a lot of time. Therefore, this study proposes the idea of applying Simplified Swarm Optimization (SSO) on the hyperparameter optimization of LeNet models while using MNIST, Fashion MNIST, and Cifar10 as validation. The experimental results show that the proposed algorithm has higher accuracy than the original LeNet model, and it only takes a very short time to find a better hyperparameter configuration after training. In addition, we also analyze the output shape of the feature map after each layer, and surprisingly, the results were mostly rectangular. The contribution of the study is to provide users with a simpler way to get better results with the existing model., and this study can also be applied to other CNN architectures.
翻译:在计算机视觉中应用的机器学习方法中,进化神经网络(CNN)在图像识别领域被广泛使用,尽管已有CNN模型已被证明是有效的,但找到一个性能较好的网络结构并非易事。有些研究选择优化网络结构,而另一些研究选择优化超参数,如进化内核的数量和大小、进化性能、集合大小等。其中大多数是手工设计的,需要相关专门知识,需要大量时间。因此,本研究提出了在使用MNIST、Fashon MNIST和Cifar10作为验证的LeNet模型的超参数优化中应用简化的SSOS(SSO)的想法。实验结果表明,拟议的算法比原始的LeNet模型更精确,在培训后找到更好的超分度配置只需要很短的时间。此外,我们还分析了地貌图在每一层之后的输出形状,令人惊讶的是,结果大多是重新剖面的。这项研究的贡献是让用户能够更简单地使用其他的模型。