Repository navigation

#

document-understanding

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python
2156
4 个月前

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python
563
9 个月前

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python
346
2 年前

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

Jupyter Notebook
268
2 天前

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

187
2 个月前

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

175
7 个月前

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python
157
1 年前

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Jupyter Notebook
132
3 个月前
Jupyter Notebook
120
2 年前

ReadingBank: A Benchmark Dataset for Reading Order Detection

104
8 个月前

Object Detection Model for Scanned Documents

Jupyter Notebook
91
1 个月前
Jupyter Notebook
65
1 个月前

Datasets and Evaluation Scripts for CompHRDoc

Python
37
2 个月前

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Python
32
12 天前

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

23
7 个月前