Repository navigation

#

pdf-conversion

borb-pdf/borb

borb is a library for reading, creating and manipulating PDF files in python.

Python
3505
1 个月前

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.

Python
672
6 个月前

DocNET is as fast PDF editing and reading library for modern .NET applications

C#
546
1 年前

Run LibreOffice in AWS Lambda to create PDFs & convert documents

Dockerfile
523
2 年前

Handcrafted Go bindings for wkhtmltopdf and high-level HTML to PDF conversion interface

Go
278
8 天前

Python library to interact with https://pdftables.com API

Python
88
6 天前

PHP library for converting the version of PDF files (for compatibility purposes).

PHP
69
3 年前

Compilation of Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.

Python
51
3 个月前

does the magical pdf-to-pdf conversion popular in academic journal submission sites and does it faster than they do

Shell
30
6 年前

Convert HTML markup into beautiful PDF files using the famous wkhtmltopdf library.

PHP
19
7 年前

This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without installing it on your system. They also ensure the system requirements and imports are all taken care of.

Jupyter Notebook
13
3 年前

Convert Thai constitution from PDF to plaintext and correct encoding glitches

HTML
10
8 年前

Convert Office file to PDF at scale

JavaScript
6
5 年前