With the increasing popularity of deep learning, Convolutional Neural Networks (CNNs) have been widely applied in various domains, such as image classification and object detection, and achieve stunning success in terms of their high accuracy over the traditional statistical methods. To exploit the potential of CNN models, a huge amount of research and industry efforts have been devoted to optimizing CNNs. Among these endeavors, CNN architecture design has attracted tremendous attention because of its great potential of improving model accuracy or reducing model complexity. However, existing work either introduces repeated training overhead in the search process or lacks an interpretable metric to guide the design. To clear these hurdles, we propose 3D-Receptive Field (3DRF), an explainable and easy-to-compute metric, to estimate the quality of a CNN architecture and guide the search process of designs. To validate the effectiveness of 3DRF, we build a static optimizer to improve the CNN architectures at both the stage level and the kernel level. Our optimizer not only provides a clear and reproducible procedure but also mitigates unnecessary training efforts in the architecture search process. Extensive experiments and studies show that the models generated by our optimizer can achieve up to 5.47% accuracy improvement and up to 65.38% parameters deduction, compared with state-of-the-art CNN structures like MobileNet and ResNet.
翻译:随着深层学习的日益普及,革命神经网络(CNNs)被广泛应用于各个领域,例如图像分类和物体探测等领域,在传统统计方法的高度精准性方面取得了惊人的成功。为了利用CNN模型的潜力,已经投入了大量研究和工业努力来优化CNN。在这些努力中,CNN的建筑设计吸引了极大的关注,因为它具有提高模型准确性或降低模型复杂性的巨大潜力。然而,现有的工作要么在搜索过程中反复引入了培训间接费用,要么在设计过程中缺乏可解释的指标来指导设计。为了清除这些障碍,我们建议3D受控字段(3DRF),这是一个可解释的、易于比较的计量标准,以估计CNN架构的质量并指导设计过程的搜索过程。为了验证3DRF的效能,我们建立了一个静态优化器,以改进CNNC架构的阶段和内核层面。我们的优化器不仅提供了清晰和可复制的程序,而且还减少了建筑搜索过程中不必要的培训努力。我们提议了3D-受控域域域域域(3DRRF),一个可解释的、可解释和易于理解的计量的计量的计量的计量的计量的计量标准,以便估计CNNCNCRM-47的模型的精确度结构升级的精确度结构结构可以达到5-47的精确度。我们生成的模型和精确度。