RaftMLP:基于MLP的模型梦想是否为计算机愿景赢得胜利? (RaftMLP: Do MLP-based Models Dream of Winning Over Computer Vision?) - 专知论文

会员服务 ·

0

归纳偏好 · Vision · 有偏 · MoDELS · SimPLe ·

2021 年 8 月 9 日

RaftMLP: Do MLP-based Models Dream of Winning Over Computer Vision?

翻译：RaftMLP:基于MLP的模型梦想是否为计算机愿景赢得胜利?

Yuki Tatsunami,Masato Taki

For the past ten years, CNN has reigned supreme in the world of computer vision, but recently, Transformer is on the rise. However, the quadratic computational cost of self-attention has become a severe problem of practice. There has been much research on architectures without CNN and self-attention in this context. In particular, MLP-Mixer is a simple idea designed using MLPs and hit an accuracy comparable to the Vision Transformer. However, the only inductive bias in this architecture is the embedding of tokens. Thus, there is still a possibility to build a non-convolutional inductive bias into the architecture itself, and we built in an inductive bias using two simple ideas. A way is to divide the token-mixing block vertically and horizontally. Another way is to make spatial correlations denser among some channels of token-mixing. With this approach, we were able to improve the accuracy of the MLP-Mixer while reducing its parameters and computational complexity. Compared to other MLP-based models, the proposed model, named RaftMLP has a good balance of computational complexity, the number of parameters, and actual memory usage. In addition, our work indicates that MLP-based models have the potential to replace CNNs by adopting inductive bias. The source code in PyTorch version is available at \url{https://github.com/okojoalg/raft-mlp}.

翻译：近十年来,CNN在计算机视觉世界中占据了最高地位,但最近,变异器正在上升。然而,自我注意的二次计算成本已经成为一个严重的实践问题。在没有CNN的情况下,对建筑进行了大量研究,在这方面,没有CNN和自我注意,特别是MLP-Mixer是一个简单的想法,它使用MLPs设计,其精确度与Vision变异器相当。然而,这个结构中唯一的感知偏差是嵌入符号。因此,仍然有可能在建筑本身中建立非横向的内向性偏向,而我们用两个简单的想法构建了隐含的偏向性。一种方式是垂直和横向分割代号混合区块。另一种方式是使某些代号混合渠道之间的空间关系更加密切。我们通过这种方法,提高了MLP-Mixer的准确性,同时降低了其参数和计算复杂性。比照其他基于 MLP 的模型, 模型,名为RaftLPLL, 其真实的模型, 和MLPsalPsimal 的精确度, 的计算模型是我们采用ML- 的模型的精确度。

0

相关内容

归纳偏好

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

129+阅读 · 2021年6月16日

一图搞定ML！2020版机器学习技术路线图，35页ppt

一图搞定ML！2020版机器学习技术路线图，35页ppt

专知会员服务

92+阅读 · 2020年7月28日

【综述：心理学、神经科学和机器学习中的注意力】《Attention in Psychology, Neuroscience, and Machine Learning | Frontiers in Computational Neuroscience》

【综述：心理学、神经科学和机器学习中的注意力】《Attention in Psychology, Neuroscience, and Machine Learning | Frontiers in Computational Neuroscience》

专知会员服务

39+阅读 · 2020年4月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

55+阅读 · 2020年1月25日

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

专知会员服务

86+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

99+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月4日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

10+阅读 · 2017年11月12日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

Understanding Robustness of Transformers for Image Classification

Understanding Robustness of Transformers for Image Classification

Arxiv

0+阅读 · 2021年10月8日

Learning with Memory-based Virtual Classes for Deep Metric Learning

Arxiv

0+阅读 · 2021年10月8日

Adversarial Attacks on Spiking Convolutional Networks for Event-based Vision

Arxiv

0+阅读 · 2021年10月6日

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

Arxiv

0+阅读 · 2021年10月6日

ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models

Arxiv

0+阅读 · 2021年10月6日

From SCAN to Real Data: Systematic Generalization via Meaningful Learning

Arxiv

0+阅读 · 2021年10月6日

MLP-Mixer: An all-MLP Architecture for Vision

Arxiv

9+阅读 · 2021年5月17日

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

Arxiv

8+阅读 · 2021年5月5日

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Arxiv

9+阅读 · 2021年3月25日

Diffusion Improves Graph Learning

Arxiv

6+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

129+阅读 · 2021年6月16日

一图搞定ML！2020版机器学习技术路线图，35页ppt

一图搞定ML！2020版机器学习技术路线图，35页ppt

专知会员服务

92+阅读 · 2020年7月28日

【综述：心理学、神经科学和机器学习中的注意力】《Attention in Psychology, Neuroscience, and Machine Learning | Frontiers in Computational Neuroscience》

【综述：心理学、神经科学和机器学习中的注意力】《Attention in Psychology, Neuroscience, and Machine Learning | Frontiers in Computational Neuroscience》

专知会员服务

39+阅读 · 2020年4月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

55+阅读 · 2020年1月25日

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

专知会员服务

86+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

99+阅读 · 2019年10月9日

热门VIP内容

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月4日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

10+阅读 · 2017年11月12日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

相关论文

Understanding Robustness of Transformers for Image Classification

Understanding Robustness of Transformers for Image Classification

Arxiv

0+阅读 · 2021年10月8日

Learning with Memory-based Virtual Classes for Deep Metric Learning

Arxiv

0+阅读 · 2021年10月8日

Adversarial Attacks on Spiking Convolutional Networks for Event-based Vision

Arxiv

0+阅读 · 2021年10月6日

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

Arxiv

0+阅读 · 2021年10月6日

ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models

Arxiv

0+阅读 · 2021年10月6日

From SCAN to Real Data: Systematic Generalization via Meaningful Learning

Arxiv

0+阅读 · 2021年10月6日

MLP-Mixer: An all-MLP Architecture for Vision

Arxiv

9+阅读 · 2021年5月17日

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

Arxiv

8+阅读 · 2021年5月5日

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Arxiv

9+阅读 · 2021年3月25日

Diffusion Improves Graph Learning

Arxiv

6+阅读 · 2019年11月14日

微信扫码咨询专知VIP会员