Repository navigation

#

wayback-machine

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
24794
3 个月前

Fetch known URLs from AlienVault's Open Threat Exchange, the Wayback Machine, and Common Crawl.

Go
4518
8 个月前

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

Go
2046
4 小时前
dessant/web-archives

Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari

JavaScript
1379
8 个月前

A collection of special paths linked to common sensitive APIs, devops internals, frameworks conf, known misconfigurations, juicy APIs ..etc. It could be used as a part of web content discovery, to scan passively for high-quality endpoints and quick-wins.

986
1 年前

Serverless replay of web archives directly in the browser

TypeScript
826
1 个月前

Wayback Machine API interface & a command-line tool

Python
544
1 年前

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

Python
453
1 年前

Tools for fighting abuse on Twitter

Rust
429
2 年前

Browse emulated browsers connected to old web sites in your browser!

JavaScript
328
10 个月前

A lightweight tool for scraping current and historic Google Analytics data

Python
218
1 年前

Extract web archive data using Wayback Machine and Common Crawl

Go
159
10 个月前
JavaScript
158
2 年前

Browser extension for quickly saving web pages to the Internet Archive's Wayback Machine.

JavaScript
155
2 年前

Extracts URLs from OSINT Archives for Security Insights

Rust
154
11 小时前