Repository navigation

#

document-understanding

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python
2238
3 个月前

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python
567
1 年前

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python
351
3 年前

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

Jupyter Notebook
286
6 天前

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

203
1 年前

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

198
6 个月前

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python
158
1 年前

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Jupyter Notebook
142
7 个月前
Jupyter Notebook
130
2 年前

ReadingBank: A Benchmark Dataset for Reading Order Detection

108
1 年前

Object Detection Model for Scanned Documents

Jupyter Notebook
94
5 个月前
Jupyter Notebook
80
5 个月前

Datasets and Evaluation Scripts for CompHRDoc

Python
49
6 个月前

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Python
36
4 个月前

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

23
1 年前