We present a simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and ReLU, while the training-time model has a multi-branch topology. Such decoupling of the training-time and inference-time architecture is realized by a structural re-parameterization technique so that the model is named RepVGG. On ImageNet, RepVGG reaches over 80\% top-1 accuracy, which is the first time for a plain model, to the best of our knowledge. On NVIDIA 1080Ti GPU, RepVGG models run 83% faster than ResNet-50 or 101% faster than ResNet-101 with higher accuracy and show favorable accuracy-speed trade-off compared to the state-of-the-art models like EfficientNet and RegNet. The code and trained models are available at https://github.com/megvii-model/RepVGG.
翻译:我们展示了一个简单而强大的革命神经网络结构,这个结构拥有一个类似VGG的推论时间体,由3x3的卷叠和ReLU组成,而培训时间模型则具有多部门地形学。这种培训时间和推论时间结构的脱钩是通过结构再校准技术实现的,因此该模型的名称是RepVGG。在图像网上,RepVGG达到80%以上最高至1级的精确度,这是我们所了解的最简单模型的第一次。关于NVIDIA 1080Ti GPU, RepVGG模型比ResNet-50或ResNet-101更快83%,精确度更高,显示与节能网络和RegNet等最新模型相比的精确速度交易速度。该代码和经过培训的模型可在https://github.com/megvivime-mod/RepVGG中查阅。