Repository navigation

#

layout-parser

Novalad offers a unified, centralized platform enabling organizations to extract meaningful data and perform advanced processing at high speed.

Jupyter Notebook
17
1 个月前

pdfDet aims to simplify PDF layout detect tasks for users.

Python
9
1 年前

Extracting structured text from GI Bill index cards for JDoc 2023 paper

Jupyter Notebook
2
2 年前

A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.

Python
2
7 个月前

Layout Parser notebook Implementation & Re-trained model for Image detection and extraction

Jupyter Notebook
1
1 年前