Repository navigation

#

ai-scraping

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript
36340
17 小时前
D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!

Python
2919
3 天前

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

Python
1665
14 天前

🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

TypeScript
309
10 天前

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

JavaScript
73
4 个月前

AI web scraper built with Crawl4AI for extracting structured leads data from websites.

Python
23
2 个月前

Python, Javascript, and Rust libraries for the Spider Cloud API.

Python
12
1 个月前

[Mirror] Self-hosted abuse detection and rule enforcement against low-effort mass AI scraping and bots.

Go
5
7 小时前

A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications

Go
2
3 个月前

AI Webpage Analyzer** is a powerful API service that extracts only the visible text content from any given URL and analyzes it using the **Haroon AI API**. It intelligently removes hidden elements, scripts, and other unnecessary content, providing a clean dataset for AI-powered analysis based on user prompts.

1
2 个月前

AI tools to enhance productivity and automate web-scraping

Jupyter Notebook
0
5 个月前

This repository contains complete application examples, developed using Skrape.ai

0
3 个月前

TypeScript/Node.js SDK to easily interact with the skrape.ai API

TypeScript
0
3 个月前

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript
0
2 个月前