Repository navigation

#

document-understanding

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript
65498
1 天前

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python
2249
4 个月前

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python
569
1 年前

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python
356
3 年前

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

Jupyter Notebook
293
24 天前

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

213
1 年前

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

202
7 个月前

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python
159
2 年前

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Jupyter Notebook
145
9 个月前
Jupyter Notebook
131
15 天前

ReadingBank: A Benchmark Dataset for Reading Order Detection

110
1 年前

Object Detection Model for Scanned Documents

Jupyter Notebook
94
7 个月前
Jupyter Notebook
88
7 个月前

Datasets and Evaluation Scripts for CompHRDoc

Python
50
7 个月前

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Python
37
6 个月前

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

24
1 年前