The ResNet and its variants have achieved remarkable successes in various computer vision tasks. Despite its success in making gradient flow through building blocks, the simple shortcut connection mechanism limits the ability of re-exploring new potentially complementary features due to the additive function. To address this issue, in this paper, we propose to introduce a regulator module as a memory mechanism to extract complementary features, which are further fed to the ResNet. In particular, the regulator module is composed of convolutional RNNs (e.g., Convolutional LSTMs or Convolutional GRUs), which are shown to be good at extracting Spatio-temporal information. We named the new regulated networks as RegNet. The regulator module can be easily implemented and appended to any ResNet architecture. We also apply the regulator module for improving the Squeeze-and-Excitation ResNet to show the generalization ability of our method. Experimental results on three image classification datasets have demonstrated the promising performance of the proposed architecture compared with the standard ResNet, SE-ResNet, and other state-of-the-art architectures.
翻译:ResNet及其变体在各种计算机愿景任务中取得了显著成功。 尽管它成功地通过构件使梯度流成为梯度流,但简单的快捷连接机制限制了由于添加功能而重新探索潜在的补充功能的能力。为了解决这一问题,我们在本文件中提议采用一个调控模块作为提取补充功能的记忆机制,这些功能将进一步反馈给ResNet。特别是,调控模块由协同RNN(例如,Convolutional LSTMS 或 Convolutional GRUs)组成,这些功能在提取Spatio-时间信息方面表现良好。我们将新的调控网络命名为RegNet。调控模块可以很容易地实施并附在任何 ResNet 结构中。我们还将调控管模块用于改进Squeze-Excuring ResNet, 以显示我们方法的总体能力。三个图像分类数据集的实验结果显示,与标准ResNet、SE-ResNet和其他状态结构相比,拟议架构的绩效良好。