Repository navigation

document-image-processing

Website
Wikipedia

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

深度学习 document-parsing 机器学习自然语言处理 OCR information-retrieval data-pipelines preprocessing pdf-to-text pdf pdf-to-json document-image-analysis donut document-image-processing document-parser docx langchain 大语言模型

HTML

12818

1049

9 天前

Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

layout-analysis 深度学习 object-detection OCR layout-parser detectron2 document-layout-analysis 机器视觉 document-image-processing layout-detection

Python

5516

511

1 年前

fh2019ustc / Awesome-Document-Image-Rectification

A comprehensive list of awesome document image rectification papers.

document-image-processing 深度学习 Awesome Lists

477

2 个月前

fh2019ustc / DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

document-image-processing OCR pytorch-implementation

Python

401

4 个月前

fh2019ustc / DocScanner

The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”, IJCV, 2025.

document-image-processing OCR

Python

261

4 个月前

GiftMungmeeprued / document-parsers-list

A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equations, handwriting, two-column layouts, and multi-column layouts.

data-pipeline document-image-processing document-parser document-parsing langchain OCR pdf pdf-to-text preprocessing

156

3 个月前

jiangnanboy / Doc-Image-Tool

文档图像处理工具(Document image processing tool)，包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / document_image_dewarping / DocTrimmingEnhancement)。

document-image-processing

Python

1 年前

fh2019ustc / DocGeoNet

The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.

document-image-processing OCR pytorch-implementation

Python

4 个月前

Nomiluks / Handwritting-OCR

Android App for English Handwritten Text Recognition

optical-character-recognition 神经网络 Android document-image-processing

Java

8 年前

caltechlibrary / documentarist

Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents

机器学习 handwriting-recognition handwritten-text-recognition annotation tagging image-classification image-recognition document-classification document-image-processing

Python

3 年前

jchazalon / smartdoc15-ch1-pywrapper

Python wrapper to facilitate data manipulation for the SmartDoc 2015 - Challenge 1 Dataset.

datasets 机器视觉 document-image-processing

Jupyter Notebook

1 年前

Transkribus / competitions

The ScriptNet / competitions site.

competition Django document-image-processing benchmark-framework

Python

7 年前

tony-xlh / quality-evaluation-of-scanned-document-images

A web app evaluating the quality the scanned document images

document-image-processing image-quality-assessment

HTML

2 年前

jiangnanboy / docimg_tool

复杂背景图像漂白，文字方向矫正，清晰增强，笔记去噪美化，去阴影，扭曲矫正，去黑点以及切边增强。complex background image bleaching, text direction correction, clarity enhancement, note to blur beautification, shadow removal, distortion correction, black spots removal and cutting edge enhancement。

document-image-processing

1 年前

image-retrieval document-image-processing cbir

7 年前

sfikas / sophia-trikoupi-handwritten-dataset

Sophia Trikoupi dataset (Collection of 46 handwritten, annotated pages)

dataset document-image-processing

Python

6 年前

mx3123 / Py-document-cropper

This script automates the process of extracting text from various file formats (images, PDFs, DOCX) using Optical Character Recognition (OCR) powered by Azure Cognitive Services. The script supports image preprocessing, text extraction, and uploading of the processed files to Google Cloud Storage (GCP).

archive document-management google-cloud-storage MongoDB Python document-image-processing

Python

8 个月前