Repository navigation

#

digipres

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
23671
1 个月前

Repository of useful FFmpeg commands for archivists!

HTML
532
4 天前

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

JavaScript
303
1 个月前

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

JavaScript
179
2 年前

List of open workflows and resources for A/V archiving

106
1 年前

FileTrove indexes files and creates metadata from them.

Go
44
2 天前

Bash scripts to manage LTO cartridges with LTFS

Shell
43
3 个月前

IFIscripts is an open-source digital preservation tool which facilitates collection management workflows within the IFI and further afield. It is freely available from the GitHub repository and subject to modification depending on the progressive needs of collections and based upon policies and preservation standards.

Python
28
2 个月前

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Ruby
28
7 个月前

Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.

HTML
25
1 年前

Digital Preservation of HTTP in documentary heritage.

Go
22
2 年前

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Python
21
9 个月前

Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.

HTML
20
1 年前

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

HTML
19
1 年前

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Python
17
7 个月前

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

JavaScript
17
1 个月前

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

CSS
15
15 天前

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

13
7 个月前