Repository navigation

#

website-scraper

Download website to local directory (including all css, images, js, etc.)

JavaScript
1603
19 天前

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

Go
1508
5 天前

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON.

TypeScript
625
1 年前

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

Python
585
5 个月前

Plugin for website-scraper which returns html for dynamic websites using puppeteer

JavaScript
332
6 天前

🕸 Generates RSS feeds of any website & serves to the web! Automatic scraping. Ready to use configs. Write your own. Rolling Docker releases for speedy updates.

Ruby
102
4 天前

Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.

C#
97
3 年前

A server to collect & archive websites that also supports video downloads

TypeScript
86
2 年前

ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to return natural language answers to the user's queries.

Python
83
1 年前

Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.

Python
78
9 个月前

Automatically curates and posts content to LinkedIn. It can optionally use web scraping to gather data, which is then fed to ChatGPT to craft engaging LinkedIn posts.

Python
74
6 天前

Plugin for website-scraper which returns html for dynamic websites using PhantomJS.

JavaScript
59
3 年前

Scraper for https://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.

Python
22
10 个月前

JSON collection of scraped file extensions, along with their description and type, from FileInfo.com

Python
18
2 年前

Now you can keep track of your followers from YouTube, Instagram and Twitter accounts - Followers scraper API on AWS serverless

TypeScript
18
2 年前

A spider to crawl webpages

Python
16
5 年前