Repository navigation

#

ocr-python

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python
36182
3 个月前

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Python
2792
18 天前

结束和新的开始

QML
945
2 年前

Lightweight & fast OCR models for license plate text recognition.

Python
212
18 天前

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

C++
178
3 年前

Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.

Python
171
3 年前

Manga OCR snipping application for desktop

Python
130
3 年前

Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in the Balkan region.

Python
125
3 年前

Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Python
123
12 天前

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

Python
98
1 个月前

A FLOSS software for Persian Optical Character Recognition

Jupyter Notebook
90
1 年前

PDF text data extraction web app with OCR for scanned documents

Python
88
1 年前

Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION

Jupyter Notebook
79
2 年前

Turn any OCR models into online inference API endpoint 🚀 🌖

Python
56
5 个月前

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

Python
54
21 小时前