Repository navigation

#

digipres

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
24793
3 个月前

Repository of useful FFmpeg commands for archivists!

HTML
549
4 个月前

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

JavaScript
346
4 个月前

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

JavaScript
181
2 年前

List of open workflows and resources for A/V archiving

107
1 年前

FileTrove indexes files and creates metadata from them.

Go
47
3 个月前

Bash scripts to manage LTO cartridges with LTFS

Shell
43
7 个月前

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Python
29
1 年前

IFIscripts is an open-source digital preservation tool which facilitates collection management workflows within the IFI and further afield. It is freely available from the GitHub repository and subject to modification depending on the progressive needs of collections and based upon policies and preservation standards.

Python
29
1 个月前

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Ruby
28
10 个月前

Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.

HTML
26
1 年前

Digital Preservation of HTTP in documentary heritage.

Go
22
2 年前

Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.

HTML
21
2 年前

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

JavaScript
19
1 个月前

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

HTML
19
2 年前

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Python
17
10 个月前

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

CSS
15
19 天前

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

13
10 个月前