Repository navigation

#

headless-chrome

A high-level browser automation library.

JavaScript
19863
1 年前
apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

TypeScript
19685
10 小时前

🖥 Chrome automation made simple. Runs locally or headless on AWS Lambda.

TypeScript
13233
7 年前

Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.

HTML
7089
2 年前

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

Python
6757
1 天前
JavaScript
3647
11 天前

Headless chrome/chromium automation library (unofficial port of puppeteer)

Python
3569
4 年前

Puppeteer Pool, run a cluster of instances in parallel

TypeScript
3454
4 个月前

Puppeteer example scripts for running Headless Chrome from Node.

JavaScript
3050
5 年前
playwright-community/playwright-go

Playwright for Go a browser automation library to control Chromium, Firefox and WebKit with a single API.

Go
2946
13 天前

🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about generated PDFs

JavaScript
2636
2 年前
JavaScript
2262
10 天前

Run Lighthouse in CI, as a web service, using Docker. Pass/Fail GH pull requests.

JavaScript
2229
5 年前