Repository navigation

#

wget

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
23667
1 个月前

Google Drive Public File Downloader when Curl/Wget Fails

Python
4599
8 个月前

Google Drive direct download of big files

Perl
942
2 年前

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Pascal
722
2 个月前

[Deprecated] Get (almost) original messages from google group archives. Your data is yours.

Shell
215
3 年前

Perform various social engineering attacks using PHP, Apache, Ngrok 🦥

HTML
205
4 年前

Create an EPUB from a list of URLs. Standing on the shoulders of Wget, Readability and Pandoc.

JavaScript
201
10 个月前

Download sequencing data and metadata from GSA, SRA, ENA, and DDBJ databases.

Shell
184
17 天前

官方权威数据:统计年签,统计公报,互联网行业报告,工信部数据,ICT报告等 Official authoritative data (Chinese)

Python
168
6 个月前

🎯 A command line download/upload tool with resume.

Rust
135
8 天前

A simple command line utility to download a remote file, similar to wget. This is not intended to be a full feature wget replacement but a simple tool to test few Rust crates.

Rust
135
6 个月前
Shell
132
5 年前

A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.

Shell
130
3 年前

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

C
120
4 个月前

do things with nopaystation tsv and links automatically

Shell
120
4 年前

A wget script for pillaging.

Shell
115
4 年前