Repository navigation

#

wget

ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python
25053
4 个月前

Google Drive Public File Downloader when Curl/Wget Fails

Python
4897
2 个月前

vet is a command-line tool that acts as a safety net for the risky curl | bash pattern. It lets you inspect, diff against previous versions, and lint remote scripts before asking for your explicit approval to execute. Promoting a safer, more transparent way to handle remote code execution.

Shell
959
1 个月前

Google Drive direct download of big files

Perl
941
2 年前

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Pascal
816
7 个月前

[Deprecated] Get (almost) original messages from google group archives. Your data is yours.

Shell
218
4 年前

Download sequencing data and metadata from GSA, SRA, ENA, and DDBJ databases.

Shell
214
1 个月前

Create an EPUB from a list of URLs. Standing on the shoulders of Wget, Readability and Pandoc.

JavaScript
205
13 天前

Perform various social engineering attacks using PHP, Apache, Ngrok 🦥

HTML
204
4 年前

官方权威数据:统计年签,统计公报,互联网行业报告,工信部数据,ICT报告等 Official authoritative data (Chinese)

Python
177
3 个月前

🎯 A command line download/upload tool with resume.

Rust
159
9 天前
Shell
135
5 年前

A simple command line utility to download a remote file, similar to wget. This is not intended to be a full feature wget replacement but a simple tool to test few Rust crates.

Rust
134
1 年前

do things with nopaystation tsv and links automatically

Shell
133
5 年前

A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.

Shell
131
4 年前

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

C
130
1 个月前