Repository navigation
tesseract
- Website
- Wikipedia
Tesseract Open Source OCR Engine (main repository)
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A wrapper to work with Tesseract OCR inside PHP.
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Experimental optical character recognition app
A Python wrapper for the tesseract-ocr API
Automation Utility - Recorder & Script Generator
Python tool for grabbing text via screenshot
Precompiled packages for AWS Lambda
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
Ruby library for working with the Tesseract OCR.
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
CCExtractor - Official version maintained by the core team