Repository navigation
Puppeteer

Puppeteer 是一个 Node.js 库,它提供了通过 DevTools 协议控制 Chrome/Chromium 浏览器的 API。 主要用于测试、Web 应用程序中的交互自动化、截取屏幕截图和爬取网页数据等场景。
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Web Extension for saving a faithful copy of a complete web page in a single HTML file
Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
An AI web browsing framework focused on simplicity and extensibility.
Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.
Proxy server to bypass Cloudflare protection
A developer-friendly API for converting numerous document formats into PDF files, and more!
Lightpanda: the headless browser designed for AI and automation
Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.
💯 Teach puppeteer new tricks through plugins.
Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on artificial intelligence and all types of design architecture for WhatsApp.
A Headless Chrome rendering solution
Turn any webpage into structured data using LLMs
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
Scan your entire site with Google Lighthouse in 2 minutes (on average). Open source, fully configurable with minimal setup.
Headless Chrome .NET API