Repository navigation

#

document-ai

Python
21088
2 个月前

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python
6175
9 个月前

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python
346
2 年前

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

188
2 个月前

ReadingBank: A Benchmark Dataset for Reading Order Detection

104
8 个月前

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

Python
103
1 年前

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python
88
19 天前

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

Python
53
1 年前

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

Jupyter Notebook
45
10 个月前

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.

Python
42
1 个月前

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Python
32
12 天前

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

27
2 年前

[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

19
5 个月前

[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"

Python
17
1 年前

A Chatbot for the Document Analysis .

Python
11
1 年前

(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨

Markdown
5
2 年前