Repository navigation

#

webscraping

Create agents that monitor and act on your behalf. Your agents are standing by!

Ruby
45931
4 天前

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript
36279
5 小时前
assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python
21006
7 小时前
Makefile
6970
4 个月前
alirezamika/autoscraper
Python
6723
6 个月前

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

JavaScript
4277
9 个月前

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

Python
3210
8 天前
D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!

Python
2914
2 天前

Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly routes traffic to avoid bans.

TypeScript
2223
10 天前

Web Scraper in Go, similar to BeautifulSoup

Go
2195
1 年前

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

Python
1664
13 天前

Vision utilities for web interaction agents 👀

Jupyter Notebook
1645
5 个月前

The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

1613
1 年前

👻 Experimental library for scraping websites using OpenAI's GPT API.

Python
1434
6 个月前

LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping

Python
1366
5 个月前