Repository navigation
digipres
- Website
- Wikipedia
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
List of open workflows and resources for A/V archiving
Home of the official docker image for ArchiveBox
FileTrove indexes files and creates metadata from them.
IFIscripts is an open-source digital preservation tool which facilitates collection management workflows within the IFI and further afield. It is freely available from the GitHub repository and subject to modification depending on the progressive needs of collections and based upon policies and preservation standards.
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
Official Python package for ArchiveBox, the self-hosted internet archiving solution.