Repository navigation

#

layout-parser

PdfDet aims to simplify PDF layout detect tasks for users.

Python
9
1 年前

Novalad offers a unified, centralized platform enabling organizations to extract meaningful data and perform advanced processing at high speed.

Jupyter Notebook
5
3 天前

Extracting structured text from GI Bill index cards for JDoc 2023 paper

Jupyter Notebook
2
2 年前

Layout Parser notebook Implementation & Re-trained model for Image detection and extraction

Jupyter Notebook
1
9 个月前

A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.

Python
0
3 个月前