Repository navigation

scrape

Website
Wikipedia

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

OSINT X (Twitter)Python scrape tweets elasticsearch kibana scrape-followers scrape-likes scrape-following

Python

16226

2780

3 年前

alirezamika / autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

scraping scraper scrape webscraping 爬虫 web-scraping 人工智能 Python webautomation 自动化机器学习

Python

6982

711

4 个月前

d60 / twikit

Python Bot client scraper scraping search X (Twitter)wrapper twitter-api twitter-scraper scrape twitter-bot twitter-client twitter-internal-api x x-api tweepy python-web-scraper

Python

3578

417

3 个月前

Anorov / cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.

Cloudflare anti-bot-page scrape scraping-websites

Python

3482

452

2 年前

microlinkhq / metascraper

Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.

metadata scrape Parsing

HTML

2568

182

10 天前

any4ai / AnyCrawl

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.

aitools crawl scrape webscraper ai-scraping data html-to-markdown rag scraping

TypeScript

2327

229

5 天前

trevorhobenshield / twitter-api-client

Implementation of X/Twitter v1, v2, and GraphQL APIs

API 自动化 client scrape X (Twitter)async Bot search twitter-api x x-api twitter-bot twitter-scraper

Python

1845

245

1 年前

Altimis / Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers, user info, images...

Selenium scraper scraping X (Twitter)tweets Python twitter-scraper scrape scrape-followers scrape-following scrape-likes

Python

1220

244

6 个月前

realsirjoe / instagram-scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

scrape Instagram igramscraper

Python

1127

110

6 年前

glebarez / cero

Scrape domain names from SSL certificates of arbitrary hosts

Reconnaissance websecurity TLS (Transport Layer Security)scrape

674

2 年前

markowanga / stweet

Advanced python library to scrap Twitter (tweets, users) from unofficial API

X (Twitter)API Python tweets unofficial search crawl scraper scrapper scrape twitter-api user users tweet scrap

Python

610

2 年前

austinoboyle / scrape-linkedin-selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Selenium linkedin scraping web-scraper web-scraping Python scrape scraper

HTML

503

166

3 年前

unixfox / pupflare

A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)

cloudflare-bypass Puppeteer Koa proxy Docker Cloudflare anti-bot-page scrape scraping-websites cloudflare-scrape Chromium

JavaScript

416

1 个月前

drudge / n8n-nodes-puppeteer

n8n node for browser automation using Puppeteer

browser Chromium n8n Puppeteer scrape screenshot pdf proxy-server scraping screenshots Script

TypeScript

411

6 天前

ScriptSmith / instamancer

Scrape Instagram's API with Puppeteer

Puppeteer Instagram instagram-api instagram-scraper data-mining scrape

TypeScript

406

3 年前

danieldotnl / ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

scraper REST API sensor scraping scrape hacs home-assistant-custom Home Assistant

Python

373

16 小时前