Repository navigation

#

webscraping

The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥

TypeScript
49321
3 小时前

Create agents that monitor and act on your behalf. Your agents are standing by!

Ruby
47088
1 天前
assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python
23081
3 天前
alirezamika/autoscraper
Python
6899
2 个月前
D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Python
6451
3 天前

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

JavaScript
4364
1 年前

Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly routes traffic to avoid bans.

TypeScript
2342
7 天前

Web Scraper in Go, similar to BeautifulSoup

Go
2213
2 年前

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

Python
1748
9 天前

Vision utilities for web interaction agents 👀

Jupyter Notebook
1718
9 个月前

The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

1667
1 年前

👻 Experimental library for scraping websites using OpenAI's GPT API.

Python
1441
2 个月前

LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping

Python
1416
9 个月前