Repository navigation

#

web-archiving

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
23667
1 个月前
Rhizome-Conifer/conifer

Collect and revisit web pages.

Python
1497
3 个月前

Core Python Web Archiving Toolkit for replay and recording of web archives

JavaScript
1491
1 天前

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

TypeScript
988
3 个月前

Free web archiving and sharing service based on Cloudflare. 基于 Cloudflare 的免费网页归档和分享工具。

TypeScript
817
1 天前

Serverless replay of web archives directly in the browser

TypeScript
787
3 天前

CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

JavaScript
775
19 天前

Run a high-fidelity browser-based web archiving crawler in a single Docker container

TypeScript
753
4 天前

Automatically archive links to videos, images, and social media content from Google Sheets (and more).

Python
689
5 天前

InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS

Python
629
1 个月前

Wayback Machine API interface & a command-line tool

Python
519
1 年前

Indelible links

JavaScript
467
3 天前

Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)

JavaScript
446
5 年前
JavaScript
443
6 年前

A Tool To Push Web Resources Into Web Archives

Python
419
1 年前

Streaming WARC/ARC library for fast web archive IO

Python
408
4 个月前

WarcDB: Web crawl data as SQLite databases.

Python
397
9 个月前

🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation

Roff
372
1 个月前

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

JavaScript
302
1 个月前

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

TypeScript
263
2 天前