Repository navigation

#

website-scraper

Download website to local directory (including all css, images, js, etc.)

JavaScript
1638
8 天前

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

Go
1616
5 小时前

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

Python
708
9 个月前

Plugin for website-scraper which returns html for dynamic websites using puppeteer

JavaScript
344
2 天前

🕸 Generates RSS feeds of any website & serves to the web! Automatic scraping. Ready to use configs. Write your own. Rolling Docker releases for speedy updates.

Ruby
108
2 天前

Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.

C#
99
3 年前

A server to collect & archive websites that also supports video downloads

TypeScript
86
3 年前

ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to return natural language answers to the user's queries.

Python
84
2 年前

Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.

Python
82
2 个月前

Automatically curates and posts content to LinkedIn. It can optionally use web scraping to gather data, which is then fed to ChatGPT to craft engaging LinkedIn posts.

Python
82
4 个月前

🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA

Python
81
2 小时前

Plugin for website-scraper which returns html for dynamic websites using PhantomJS.

JavaScript
59
4 年前

Scraper for https://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.

Python
26
1 年前

JSON collection of scraped file extensions, along with their description and type, from FileInfo.com

Python
19
3 年前

Now you can keep track of your followers from YouTube, Instagram and Twitter accounts - Followers scraper API on AWS serverless

TypeScript
19
3 年前