Repository navigation

apify

Website
Wikipedia

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

web-scraping web-crawling npm headless-chrome Puppeteer 自动化 apify scraping crawling 爬虫 headless scraper web-crawler JavaScript Node.js Playwright TypeScript

TypeScript

19685

1018

10 小时前

apify / crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

apify 自动化 beautifulsoup 爬虫 crawling headless headless-chrome pip Playwright Python scraper scraping web-crawler web-crawling web-scraping Hacktoberfest

Python

6757

482

1 天前

apify / apify-sdk-js

Apify SDK monorepo

actor apify JavaScript Node.js SDK TypeScript

TypeScript

158

1 天前

apify / apify-cli

Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

命令行界面 headless-chrome Puppeteer apify Hacktoberfest

TypeScript

157

19 小时前

apify / apify-sdk-python

The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

apify 自动化 Python scraping SDK

Python

146

1 天前

apify / actor-scraper

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

web-scraping apify

JavaScript

127

2 年前

superryeti / Hands-on-WebScraping

This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to websites using advanced protection.

Python Node.js scrapy Puppeteer apify requests 爬虫

Python

2 年前

apify / apify-client-python

Apify API client for Python

API apify client Python scraping

Python

2 天前

VaclavRut / actor-amazon-crawler

Amazon crawler - this configuration will extract items for a keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.

apify

JavaScript

5 年前

maxCopell / tripadvisor-scraper

Scrape Tripadvisor restaurant, hotels, and places.

apify scraper

JavaScript

3 年前

MrXujiang / crawel

基于Apify+node+react搭建的有点意思的爬虫平台

Node.js Puppeteer 爬虫 apify React react-hooks umi

JavaScript

5 年前

apify / super-scraper

Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!

API scraping apify cheerio JavaScript Node.js Playwright TypeScript web-scraping

TypeScript

10 个月前

JuroOravec / crawlee-one

Professional scrapers that provide full control to the users. Crawlee One builds on top of Crawlee and Apify and extends them with features for robust and highly configurable web scrapers.

actor apify 爬虫框架 scraper scraping Web

TypeScript

1 年前

bernardro / actor-youtube-scraper

Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start scraping.

apify 爬虫 search YouTube

JavaScript

3 年前

apify / actor-content-checker

You can use this act to monitor any page's content and get a notification when content changes.

apify web-scraping

JavaScript

3 年前

sauermar / web-browser-recorder

Web application for recording, management and editing of inteligent RPA workflows using Playwright technology

Playwright React browser 自动化 apify material-ui TypeScript user-friendly

TypeScript

3 年前

metalwarrior665 / actor-google-sheets

No more dealing with Google API. Simple Node.js program to automate access to Google Sheets.

Spreadsheet apify Google Sheets

JavaScript

3 年前

pocesar / actor-shopify-scraper

Automate monitoring prices on the most popular solution for building online stores and selling products online. Crawl arbitrary Shopify-powered online stores and extract a list of all products in a structured form, including product title, price, description, etc

apify scraper scraping JavaScript shopify

JavaScript

2 年前