Repository navigation

#

headless-chrome

TypeScript
91770
11 小时前

A high-level browser automation library.

JavaScript
19714
1 年前
apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

TypeScript
18781
24 分钟前

🖥 Chrome automation made simple. Runs locally or headless on AWS Lambda.

TypeScript
13231
7 年前

Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.

HTML
7082
2 年前

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

Python
6181
5 小时前
JavaScript
3635
3 个月前

Headless chrome/chromium automation library (unofficial port of puppeteer)

Python
3572
4 年前

Puppeteer Pool, run a cluster of instances in parallel

TypeScript
3437
2 个月前

Puppeteer example scripts for running Headless Chrome from Node.

JavaScript
3054
5 年前
playwright-community/playwright-go

Playwright for Go a browser automation library to control Chromium, Firefox and WebKit with a single API.

Go
2858
3 个月前

🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about generated PDFs

JavaScript
2633
1 年前
JavaScript
2264
5 天前

Run Lighthouse in CI, as a web service, using Docker. Pass/Fail GH pull requests.

JavaScript
2230
5 年前