OCR（光学字符识别）论文 - 专知

会员服务 ·

OCR（光学字符识别）

OCR（光学字符识别）

OCR（光学字符识别）

DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization

Arxiv

0+阅读 · 12月18日

CoT4Det: A Chain-of-Thought Framework for Perception-Oriented Vision-Language Tasks

Arxiv

0+阅读 · 12月7日

Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition

Arxiv

0+阅读 · 10月22日

KrishokBondhu: A Retrieval-Augmented Voice-Based Agricultural Advisory Call Center for Bengali Farmers

Arxiv

0+阅读 · 10月21日

CharDiff: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration

Arxiv

0+阅读 · 10月20日

Detecting Legend Items on Historical Maps Using GPT-4o with In-Context Learning

Detecting Legend Items on Historical Maps Using GPT-4o with In-Context Learning

Arxiv

0+阅读 · 10月9日

A Lightweight Multi-Module Fusion Approach for Korean Character Recognition

Arxiv

0+阅读 · 4月8日

Task-driven single-image super-resolution reconstruction of document scans

Arxiv

0+阅读 · 3月18日

Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription

Arxiv

0+阅读 · 2月27日

ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

Arxiv

0+阅读 · 2月26日

Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents

Arxiv

0+阅读 · 2月6日

Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models

Arxiv

0+阅读 · 2月18日

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

Arxiv

0+阅读 · 1月8日

Geometry Restoration and Dewarping of Camera-Captured Document Images

Arxiv

0+阅读 · 1月6日

Geometry Restoration and Dewarping of Camera-Captured Document Images

Arxiv

0+阅读 · 1月9日

参考链接

微信扫码咨询专知VIP会员