【推荐】你的神经网络不好用的37个原因

2017 年 8 月 13 日 机器学习研究会

点击上方 “机器学习研究会”可以订阅

摘要

转自：ArnetMiner

The network had been training for the last 12 hours. It all looked good: the gradients were flowing and the loss was decreasing. But then came the predictions: all zeroes, all background, nothing detected. “What did I do wrong?” — I asked my computer, who didn’t answer.

Where do you start checking if your model is outputting garbage (for example predicting the mean of all outputs, or it has really poor accuracy)?

A network might not be training for a number of reasons. Over the course of many debugging sessions, I would often find myself doing the same checks. I’ve compiled my experience along with the best ideas around in this handy list. I hope they would be of use to you, too.

0. How to use this guide?

I. Dataset issues

II. Data Normalization/Augmentation issues

III. Implementation issues

IV. Training issues

0. How to use this guide?

A lot of things can go wrong. But some of them are more likely to be broken than others. I usually start with this short list as an emergency first response:

Start with a simple model that is known to work for this type of data (for example, VGG for images). Use a standard loss if possible.
Turn off all bells and whistles, e.g. regularization and data augmentation.
If finetuning a model, double check the preprocessing, for it should be the same as the original model’s training.
Verify that the input data is correct.
Start with a really small dataset (2–20 samples). Overfit on it and gradually add more data.
Start gradually adding back all the pieces that were omitted: augmentation/regularization, custom loss functions, try more complex models.

If the steps above don’t do it, start going down the following big list and verify things one by one.

链接（需翻墙）：

https://blog.slavv.com/37-reasons-why-your-neural-network-is-not-working-4020854bd607

原文链接：

https://m.weibo.cn/1870858943/4139636177186495

“完整内容”请点击【阅读原文】

↓↓↓

登录查看更多

相关内容

神经网络

关注 5910

人工神经网络（Artificial Neural Network，即ANN ），是20世纪80 年代以来人工智能领域兴起的研究热点。它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。最近十多年来，人工神经网络的研究工作不断深入，已经取得了很大的进展，其在模式识别、智能机器人、自动控制、预测估计、生物、医学、经济等领域已成功地解决了许多现代计算机难以解决的实际问题，表现出了良好的智能特性。

史上机器学习 &深度学习课程大合集，一站搞定，Deep Learning Drizzle

专知会员服务

176+阅读 · 2020年5月10日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【干货】模型不work怎么办？大神Josh Tobin141页PPT告诉你怎么改模型

专知会员服务

30+阅读 · 2019年11月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日