Repository navigation

#

scraping-framework

Lightweight web scraping toolkit for documents and structured data.

Python
314
2 年前
Python
311
1 年前

Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis

Python
37
2 年前

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

Java
32
3 年前

An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)

Python
21
1 个月前

ProxyCrawl PHP library for scraping and crawling websites

PHP
21
2 年前

An API wrapper for Scrappey.com written in Node.js (cloudflare bypass & solver)

JavaScript
12
2 年前

A simple, easy to use, scalable scraping framework written in PHP

PHP
10
5 年前

Web scraping API to outsource tons of GET & xpath to cloud computing

Python
7
1 年前

A powerful web scraping library built with Playwright that provides a declarative, step-by-step approach to web automation and data extraction.

TypeScript
4
5 天前

M.A. Thesis work, news scraping framework/pipeline using python, beautifulsoup, newspaper3k, flask and mongodb with a custom api.

Python
3
7 年前