OCR文字,车牌,验证码识别 专知荟萃
入门学习
- 端到端的OCR:基于CNN的实现
- 如何用卷积神经网络CNN识别手写数字集?
- OCR文字识别用的是什么算法?
- 基于计算机视觉/深度学习打造先进OCR工作流
Creating a Modern OCR Pipeline Using Computer Vision and Deep Learning
- 车牌识别中的不分割字符的端到端(End-to-End)识别
- 端到端的OCR:基于CNN的实现
- 腾讯OCR—自动识别技术,探寻文字真实的容颜
- Tesseract-OCR引擎 入门
- 汽车挡风玻璃VIN码识别
- 车牌识别算法的关键技术及其研究现状
- 端到端的OCR:验证码识别
论文及代码
文字识别
- Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks
- End-to-End Text Recognition with Convolutional Neural Networks
- Word Spotting and Recognition with Embedded Attributes
- Reading Text in the Wild with Convolutional Neural Networks
- Deep structured output learning for unconstrained text recognition
- Deep Features for Text Spotting
- Reading Scene Text in Deep Convolutional Sequences
- DeepFont: Identify Your Font from An Image
- An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
- Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
- Writer-independent Feature Learning for Offline Signature Verification using Deep Convolutional Neural Networks
- DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
- End-to-End Interpretation of the French Street Name Signs Dataset
- End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance
- Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading
- Improving Text Proposals for Scene Images with Fully Convolutional Networks
- intro: Universitat Autonoma de Barcelona & University of Florence
- intro: International Conference on Pattern Recognition - DLPR workshop
- arxiv: [https://arxiv.org/abs/1702.05089]
- Scene Text Eraser
- Attention-based Extraction of Structured Information from Street View Imagery
- STN-OCR: A single Neural Network for Text Detection and Text Recognition
- Sequence to sequence learning for unconstrained scene text recognition
- Drawing and Recognizing Chinese Characters with Recurrent Neural Network
- Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition
- Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition
- Visual attention models for scene text recognition
- Focusing Attention: Towards Accurate Text Recognition in Natural Images
- Scene Text Recognition with Sliding Convolutional Character Models
- AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition
- A New Hybrid-parameter Recurrent Neural Networks for Online Handwritten Chinese Character Recognition
- Arbitrarily-Oriented Text Recognition
文字检测
-
Object Proposals for Text Extraction in the Wild
-
Text-Attentional Convolutional Neural Networks for Scene Text Detection
-
Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network
-
Synthetic Data for Text Localisation in Natural Images
-
Scene Text Detection via Holistic, Multi-Channel Prediction
-
Detecting Text in Natural Image with Connectionist Text Proposal Network
-
TextBoxes: A Fast Text Detector with a Single Deep Neural Network
-
Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection
-
Detecting Oriented Text in Natural Images by Linking Segments
-
Deep Direct Regression for Multi-Oriented Scene Text Detection
-
Cascaded Segmentation-Detection Networks for Word-Level Text Spotting
-
WordFence: Text Detection in Natural Images with Border Awareness
-
SSD-text detection: Text Detector
-
R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
-
R-PHOC: Segmentation-Free Word Spotting using CNN
-
Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
-
EAST: An Efficient and Accurate Scene Text Detector
-
Deep Scene Text Detection with Connected Component Proposals
-
Single Shot Text Detector with Regional Attention
-
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
-
Deep Residual Text Detection Network for Scene Text
- intro: IAPR International Conference on Document Analysis and Recognition 2017. Samsung R&D Institute of China, Beijing
- arxiv: [https://arxiv.org/abs/1711.04147]
-
Feature Enhancement Network: A Refined Scene Text Detector
-
ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene
验证码破解
- Using deep learning to break a Captcha system
- Breaking reddit captcha with 96% accuracy
- I’m not a human: Breaking the Google reCAPTCHA
- Neural Net CAPTCHA Cracker
- Recurrent neural networks for decoding CAPTCHAS
- Reading irctc captchas with 95% accuracy using deep learning
- I Am Robot: Learning to Break Semantic Image CAPTCHAs
- SimGAN-Captcha
手写体识别
- High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps
- Recognize your handwritten numbers
- Handwritten Digit Recognition using Convolutional Neural Networks in Python with Keras
- MNIST Handwritten Digit Classifier
- LeNet – Convolutional Neural Network in Python
- Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
- MLPaint: the Real-Time Handwritten Digit Recognizer
- Training a Computer to Recognize Your Handwriting
- Using TensorFlow to create your own handwriting recognition engine
- Building a Deep Handwritten Digits Classifier using Microsoft Cognitive Toolkit
- Hand Writing Recognition Using Convolutional Neural Networks
- Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
- Handwritten digit string recognition by combination of residual network and RNN-CTC
车牌识别
- Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs
- Number plate recognition with Tensorflow
- end-to-end-for-plate-recognition
- Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN
- intro: International Workshop on Advanced Image Technology, January, 8-10, 2017. Penang, Malaysia. Proceeding IWAIT2017
- arxiv: [https://arxiv.org/abs/1701.06439]
- License Plate Detection and Recognition Using Deeply Learned Convolutional Neural Networks
- Adversarial Generation of Training Examples for Vehicle License Plate Recognition
- Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks
实战项目
- 多标签分类,端到端基于mxnet的中文车牌识别
- 中国二代身份证光学识别
- EasyPR 一个开源的中文车牌识别系统
- 汽车挡风玻璃VIN码识别
- CLSTM : A small C++ implementation of LSTM networks, focused on OCR
- OCR text recognition using tensorflow with attention
- Digit Recognition via CNN: digital meter numbers detection
- Attention-OCR: Visual Attention based OCR
- umaru: An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm
- Tesseract.js: Pure Javascript OCR for 62 Languages
- DeepHCCR: Offline Handwritten Chinese Character Recognition based on GoogLeNet and AlexNet
- deep ocr: make a better chinese character recognition OCR than tesseract
- Practical Deep OCR for scene text using CTPN + CRNN
- Text-Detection-using-py-faster-rcnn-framework
- ocropy: Python-based tools for document analysis and OCR
- Extracting text from an image using Ocropus
视频
- LSTMs for OCR
初步版本,水平有限,有错误或者不完善的地方,欢迎大家提建议和补充,会一直保持更新,本文为专知内容组原创内容,未经允许不得转载,如需转载请发送邮件至fangquanyi@gmail.com 或 联系微信专知小助手(Rancho_Fang)
敬请关注http://www.zhuanzhi.ai 和关注专知公众号,获取第一手AI相关知识