Repository navigation

#

document-ai

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python
6491
1 年前

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python
351
3 年前

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

198
6 个月前

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

Python
108
2 年前

ReadingBank: A Benchmark Dataset for Reading Order Detection

108
1 年前

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python
92
5 个月前

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

Python
53
2 年前

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

Jupyter Notebook
49
1 年前

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.

Python
46
5 个月前

[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents

Python
37
3 个月前

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Python
36
4 个月前

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

29
2 年前

[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

20
9 个月前

[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"

Python
17
2 年前

This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handwritten documents)

Python
16
1 个月前