Repository navigation
extract
- Website
- Wikipedia
SwiftSoup: Pure Swift HTML Parser, with best of DOM, CSS, and jquery (Supports Linux, iOS, Mac, tvOS, watchOS)
PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Database Subsetting and Relational Data Browsing Tool.
GUI and API library to work with Engine assets, serialized and bundle files
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Reversing Google's 3D satellite mode
This extension is now maintained in the Microsoft fork.
A library to read, parse, export and make subsets of different types of font files.
To extract main article from given URL with Node.js
A web interface to extract tabular data from PDFs
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
The extension provides refactoring tools for your React codebase
A tool to view and extract the contents of an Windows Installer (.msi) file.
Deobfuscate obfuscator.io, unminify and unpack bundled javascript
extrakto for tmux - quickly select, copy/insert/complete text without a mouse