Repository navigation

#

wayback-machine

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
23671
1 个月前

Fetch known URLs from AlienVault's Open Threat Exchange, the Wayback Machine, and Common Crawl.

Go
4310
4 个月前

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

Go
1944
5 小时前
dessant/web-archives

Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari

JavaScript
1301
4 个月前

A collection of special paths linked to common sensitive APIs, devops internals, frameworks conf, known misconfigurations, juicy APIs ..etc. It could be used as a part of web content discovery, to scan passively for high-quality endpoints and quick-wins.

971
10 个月前

Serverless replay of web archives directly in the browser

TypeScript
788
4 天前

Wayback Machine API interface & a command-line tool

Python
519
1 年前

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

Python
442
1 年前

Tools for fighting abuse on Twitter

Rust
425
2 年前

Browse emulated browsers connected to old web sites in your browser!

JavaScript
305
6 个月前

A lightweight tool for scraping current and historic Google Analytics data

Python
207
8 个月前

Extract web archive data using Wayback Machine and Common Crawl

Go
155
6 个月前
JavaScript
155
1 年前

Browser extension for quickly saving web pages to the Internet Archive's Wayback Machine.

JavaScript
152
2 年前

A tool for append URLs, skipping duplicates/paths & combine parameters.

Go
121
3 年前