Repository navigation
scrape
- Website
- Wikipedia
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot
A Python module to bypass Cloudflare's anti-bot page.
Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.
AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.
Implementation of X/Twitter v1, v2, and GraphQL APIs
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers, user info, images...
scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Scrape domain names from SSL certificates of arbitrary hosts
Advanced python library to scrap Twitter (tweets, users) from unofficial API
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)
n8n node for browser automation using Puppeteer
Scrape Instagram's API with Puppeteer
Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Crawl telegra.ph searching for nudes!
Scrape any website, article or RSS/Atom Feed with ease!