Repository navigation

#

tesseract

Tesseract Open Source OCR Engine (main repository)

C++
68913
4 天前
naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript
37038
13 天前
ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python
30849
1 天前
pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python
7825
1 天前

Trained models with fast variant of the "best" LSTM models + legacy models

7093
1 年前
aisingapore/TagUI
JavaScript
6062
6 个月前
tebelorg/RPA-Python
Python
5313
6 个月前

A wrapper to work with Tesseract OCR inside PHP.

PHP
2994
5 个月前

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Go
2938
5 个月前

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python
2733
6 个月前

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.

Python
2262
4 天前

Experimental optical character recognition app

Java
2242
7 年前

A Python wrapper for the tesseract-ocr API

Python
2113
11 天前

Automation Utility - Recorder & Script Generator

AutoHotkey
1844
3 年前
Python
1774
8 个月前

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

Python
914
1 年前

Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.

C
866
21 天前

Ruby library for working with the Tesseract OCR.

Ruby
861
4 个月前