Repository navigation
Puppeteer

Puppeteer 是一个 Node.js 库,它提供了通过 DevTools 协议控制 Chrome/Chromium 浏览器的 API。 主要用于测试、Web 应用程序中的交互自动化、截取屏幕截图和爬取网页数据等场景。
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Web Extension for saving a faithful copy of a complete web page in a single HTML file
Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.
Proxy server to bypass Cloudflare protection
A developer-friendly API for converting numerous document formats into PDF files, and more!
Lightpanda: the headless browser designed for AI and automation
Chrome DevTools for coding agents
Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.
💯 Teach puppeteer new tricks through plugins.
🚀 Venom by VYNECT™ — Now part of ERA CONNECT™ Venom is now part of the ERA CONNECT™ ecosystem by VYNECT™, offering a freemium solution for ethical WhatsApp automation. Automate chats, simulate interactions, and send or receive media — with free usage limits and the option to upgrade to ERA CONNECT PRO for advanced features and stability.
Turn any webpage into structured data using LLMs
A Headless Chrome rendering solution
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
Google Lighthouse for your entire site.