Repository navigation

#

pdf-conversion

jorisschellekens/borb

borb is a library for reading, creating and manipulating PDF files in python.

Python
3468
5 个月前

Run LibreOffice in AWS Lambda to create PDFs & convert documents

Dockerfile
521
2 年前

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.

Python
504
2 个月前

DocNET is as fast PDF editing and reading library for modern .NET applications

C#
497
1 年前

Handcrafted Go bindings for wkhtmltopdf and high-level HTML to PDF conversion interface

Go
267
1 个月前

Python library to interact with https://pdftables.com API

Python
86
1 年前

PHP library for converting the version of PDF files (for compatibility purposes).

PHP
68
3 年前

Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.

Jupyter Notebook
50
7 年前

does the magical pdf-to-pdf conversion popular in academic journal submission sites and does it faster than they do

Shell
30
6 年前

Convert HTML markup into beautiful PDF files using the famous wkhtmltopdf library.

PHP
19
7 年前

This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without installing it on your system. They also ensure the system requirements and imports are all taken care of.

Jupyter Notebook
13
3 年前

Convert Thai constitution from PDF to plaintext and correct encoding glitches

HTML
9
7 年前

Convert Office file to PDF at scale

JavaScript
6
4 年前

Lightweight Helper classes based on iTextSharp for scaling and resizing Pdf Documents & Pages.

C#
4
1 年前