Repository navigation

#

tesseract

Tesseract Open Source OCR Engine (main repository)

C++
66255
23 天前
naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript
36401
12 天前
ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python
27638
13 天前
pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python
6955
2 天前

Trained models with fast variant of the "best" LSTM models + legacy models

6856
1 年前
aisingapore/TagUI
JavaScript
5904
2 个月前
tebelorg/RPA-Python
Python
5174
2 个月前

A wrapper to work with Tesseract OCR inside PHP.

PHP
2954
25 天前

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Go
2842
25 天前

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python
2616
2 个月前

Experimental optical character recognition app

Java
2233
7 年前

A Python wrapper for the tesseract-ocr API

Python
2085
2 个月前

Automation Utility - Recorder & Script Generator

AutoHotkey
1778
3 年前
Python
1761
4 个月前

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

Python
887
1 年前

Ruby library for working with the Tesseract OCR.

Ruby
854
2 年前

Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.

C
822
4 个月前