【专知荟萃19】图像识别Image Recognition知识资料全集（入门/进阶/论文/综述/视频/专家，附查看）

2017 年 11 月 18 日 专知专知内容组

点击上方“专知”关注获取专业AI知识!

【导读】主题荟萃知识是专知的核心功能之一，为用户提供AI领域系统性的知识学习服务。主题荟萃为用户提供全网关于该主题的精华（Awesome）知识资料收录整理，使得AI从业者便捷学习和解决工作问题！在专知人工智能主题知识树基础上，主题荟萃由专业人工编辑和算法工具辅助协作完成，并保持动态更新！另外欢迎对此创作主题荟萃感兴趣的同学，请加入我们专知AI创作者计划，共创共赢！今天专知为大家呈送第十八篇专知主题荟萃-图像识别知识资料大全集荟萃 （入门/进阶/综述/视频/代码/专家等），请大家查看！专知访问www.zhuanzhi.ai, 或关注微信公众号后台回复" 专知"进入专知，搜索主题“图像识别”查看。此外，我们也提供该文网页桌面手机端（www.zhuanzhi.ai）完整访问，可直接点击访问收录链接地址，以及pdf版下载链接，请文章末尾查看！此为初始版本，请大家指正补充，欢迎在后台留言！欢迎大家分享转发~

图像识别 Image Recognition 专知荟萃

入门学习
进阶文章
Imagenet result
2013
2014
2015
2016
2017
综述
Tutorial
视频教程
Datasets
代码
领域专家

入门学习

如何识别图像边缘？阮一峰

[http://www.ruanyifeng.com/blog/2016/07/edge-recognition.html]

CS231n课程笔记翻译：图像分类笔记

[https://zhuanlan.zhihu.com/p/20894041]

[http://cs231n.github.io/classification/]

深度学习、图像分类入门，从VGG16卷积神经网络开始 [http://blog.csdn.net/Errors_In_Life/article/details/65950699\]
The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3) 翻译

[http://blog.csdn.net/darkprince120/article/details/53024714]

深度学习框架Caffe图片分类教程

[http://blog.csdn.net/qq_31258245/article/details/75093380\]

MobileNet教程：用TensorFlow搭建在手机上运行的图像分类器

[https://zhuanlan.zhihu.com/p/28199892]

图像验证码和大规模图像识别技术

[http://www.infoq.com/cn/articles/CAPTCHA-image-recognition]

卷积神经网络如何进行图像识别

[http://www.infoq.com/cn/articles/convolutional-neural-networks-image-recognition]

图像识别与验证码

[https://zhuanlan.zhihu.com/securityCode]

图像识别(知乎话题) - [https://www.zhihu.com/topic/19588774/top-answers?page=1]

进阶文章

Imagenet result

Microsoft (Deep Residual Learning] [http://arxiv.org/pdf/1512.03385v1.pdfSlide](http://image-net.org/challenges/talks/ilsvrc2015_deep_residual_learning_kaiminghe.pdf]][[] Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition, arXiv:1512.03385.
Microsoft (PReLu/Weight Initialization] [http://arxiv.org/pdf/1502.01852] Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, arXiv:1502.01852.
Batch Normalization [http://arxiv.org/pdf/1502.03167] Sergey Ioffe, Christian Szegedy, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, arXiv:1502.03167.
GoogLeNet [http://arxiv.org/pdf/1409.4842] Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich, CVPR, 2015.
VGG-Net [http://www.robots.ox.ac.uk/~vgg/research/very_deep/] [http://arxiv.org/pdf/1409.1556] Karen Simonyan and Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Visual Recognition, ICLR, 2015.
AlexNet [http://papers.nips.cc/book/advances-in-neural-information-processing-systems-25-2012] Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks, NIPS, 2012.

2013

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, Trevor Darrell

[http://arxiv.org/abs/1310.1531]

2014

CNN Features off-the-shelf: an Astounding Baseline for Recognition CVPR 2014

[http://arxiv.org/abs/1403.6382]

Deeply learned face representations are sparse, selective, and robust

[http://arxiv.org/abs/1412.1265]

Deep Learning Face Representation by Joint Identification-Verification
- [https://arxiv.org/abs/1406.4773]
Deep Learning Face Representation from Predicting 10,000 Classes. intro: CVPR 2014

[http://mmlab.ie.cuhk.edu.hk/pdf/YiSun_CVPR14.pdf]

Multiple Object Recognition with Visual Attention**

[https://arxiv.org/abs/1412.7755]

2015

HD-CNN: Hierarchical Deep Convolutional Neural Network for Image Classification intro: ICCV 2015

[https://arxiv.org/abs/1410.0736]

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. ImageNet top-5 error: 4.94%

[http://arxiv.org/abs/1502.01852]

Multi-attribute Learning for Pedestrian Attribute Recognition in Surveillance Scenarios

[http://ieeexplore.ieee.org/document/7486476/]

FaceNet: A Unified Embedding for Face Recognition and Clustering

[http://arxiv.org/abs/1503.03832]

2016

Humans and deep networks largely agree on which kinds of variation make object recognition harder**

[http://arxiv.org/abs/1604.06486]

FusionNet: 3D Object Classification Using Multiple Data Representations

[https://arxiv.org/abs/1607.05695]

Deep FisherNet for Object Classification**

[http://arxiv.org/abs/1608.00182]

Factorized Bilinear Models for Image Recognition**

[https://arxiv.org/abs/1611.05709]

Hyperspectral CNN Classification with Limited Training Samples**

[https://arxiv.org/abs/1611.09007]

The More You Know: Using Knowledge Graphs for Image Classification**

[https://arxiv.org/abs/1612.04844]

MaxMin Convolutional Neural Networks for Image Classification**

[http://webia.lip6.fr/~thomen/papers/Blot_ICIP_2016.pdf]

Cost-Effective Active Learning for Deep Image Classification. TCSVT 2016.

[https://arxiv.org/abs/1701.03551]

DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment

[http://arxiv.org/abs/1606.05675]

2017

Deep Collaborative Learning for Visual Recognition

[https://www.arxiv.org/abs/1703.01229]

Bilinear CNN Models for Fine-grained Visual Recognition

[http://vis-www.cs.umass.edu/bcnn/]

Multiple Instance Learning Convolutional Neural Networks for Object Recognition**

[https://arxiv.org/abs/1610.03155]

B-CNN: Branch Convolutional Neural Network for Hierarchical Classification

[https://arxiv.org/abs/1709.09890](

Why Do Deep Neural Networks Still Not Recognize These Images?: A Qualitative Analysis on Failure Cases of ImageNet Classification

[https://arxiv.org/abs/1709.03439]

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

[https://arxiv.org/abs/1706.07901]
Sunrise or Sunset: Selective Comparison Learning for Subtle Attribute Recognition
[https://arxiv.org/abs/1707.06335]

Convolutional Low-Resolution Fine-Grained Classification

[https://arxiv.org/abs/1703.05393]

综述

A Review of Image Recognition with Deep Convolutional Neural Network

[https://link.springer.com/chapter/10.1007/978-3-319-63309-1_7\]

Review on Image Recognition

[http://pnrsolution.org/Datacenter/Vol3/Issue2/186.pdf]

深度学习在图像识别中的研究进展与展望

[https://piazza-resources.s3.amazonaws.com/i48o74a0lqu0/i4fcg2o44k63n6/deep_recognition.pdf?AWSAccessKeyId=AKIAIEDNRLJ4AZKBW6HA&Expires=1509460321&Signature=DxZ8LrEEStKQrKESDufA7i3qIGA%3D\]

图像物体分类与检测算法综述黄凯奇任伟强谭铁牛 [http://cjc.ict.ac.cn/online/cre/hkq-2014526115913.pdf]
Book Chapter - Objecter Recognition

[http://www.cse.usf.edu/~r1k/MachineVisionBook/MachineVision.files/MachineVision_Chapter15.pdf\]

Tutorial

CVPR tutorial : Large-Scale Visual Recognition

[http://www.europe.naverlabs.com/Research/Computer-Vision/Highlights/CVPR-tutorial-Large-Scale-Visual-Recognition]

Image Recognition with Tensorflow

[https://www.tensorflow.org/tutorials/image_recognition\]

Visual Object Recognition Tutorial by Bastian Leibe & Kristen Grauman

[https://www.google.com.au/url?sa=t&rct=j&q=&esrc=s&source=web&cd=32&cad=rja&uact=8&ved=0ahUKEwiWrq3W5JrXAhWFLpQKHQPuCcI4HhAWCC8wAQ&url=http%3A%2F%2Fz.cs.utexas.edu%2Fusers%2Fpiyushk%2Fcourses%2Fspr12%2Fslides%2FAAAI-tutorial-2.ppt&usg=AOvVaw3tQkyK0zW7nZ28LhrGzCUC]

视频教程

CS231n: Convolutional Neural Networks for Visual Recognition

[http://cs231n.stanford.edu/]

李飞飞: 我们怎么教计算机理解图片？
- [https://www.youtube.com/watch?v=40riCqvRoMs]

Datasets

MNIST: handwritten digits (http://yann.lecun.com/exdb/mnist/)
NIST: similar to MNIST, but larger
Perturbed NIST: a dataset developed in Yoshua’s class (NIST with tons of deformations)
CIFAR10 / CIFAR100: 32×32 natural image dataset with 10/100 categories ( http://www.cs.utoronto.ca/~kriz/cifar.html)
Caltech 101: pictures of objects belonging to 101 categories (http://www.vision.caltech.edu/Image_Datasets/Caltech101/)
Caltech 256: pictures of objects belonging to 256 categories (http://www.vision.caltech.edu/Image_Datasets/Caltech256/)
Caltech Silhouettes: 28×28 binary images contains silhouettes of the Caltech 101 dataset
STL-10 dataset is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. It is inspired by the CIFAR-10 dataset but with some modifications. http://www.stanford.edu/~acoates//stl10/
The Street View House Numbers (SVHN) Dataset – http://ufldl.stanford.edu/housenumbers/
NORB: binocular images of toy figurines under various illumination and pose (http://www.cs.nyu.edu/~ylclab/data/norb-v1.0/)
Imagenet: image database organized according to the WordNethierarchy (http://www.image-net.org/)
Pascal VOC: various object recognition challenges (http://pascallin.ecs.soton.ac.uk/challenges/VOC/)
Labelme: A large dataset of annotated images, http://labelme.csail.mit.edu/Release3.0/browserTools/php/dataset.php
COIL 20: different objects imaged at every angle in a 360 rotation(http://www.cs.columbia.edu/CAVE/software/softlib/coil-20.php)
COIL100: different objects imaged at every angle in a 360 rotation (http://www1.cs.columbia.edu/CAVE/software/softlib/coil-100.php)