Repository navigation
ai-scraping
- Website
- Wikipedia
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Lightweight library for scraping web-sites with LLMs
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
AI web scraper built with Crawl4AI for extracting structured leads data from websites.
How to guides on web-crawling or scraping
Python, Javascript, and Rust libraries for the Spider Cloud API.
[Mirror] Self-hosted abuse detection and rule enforcement against low-effort mass AI scraping and bots.
A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications
AI Webpage Analyzer** is a powerful API service that extracts only the visible text content from any given URL and analyzes it using the **Haroon AI API**. It intelligently removes hidden elements, scripts, and other unnecessary content, providing a clean dataset for AI-powered analysis based on user prompts.
AI tools to enhance productivity and automate web-scraping
This repository contains complete application examples, developed using Skrape.ai
TypeScript/Node.js SDK to easily interact with the skrape.ai API
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.