Repository navigation

scrapy

Website
Wikipedia

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

webcrawler scrapy crawlab spiders-management Go scrapyd-ui spider 爬虫 webspider web-crawler Docker platform crawling-tasks

11995

1871

6 天前

lining0806 / PythonSpiderNotes

Python入门网络爬虫之精华版

Python zhihu captcha WeChat scrapy Selenium cookie

Python

7311

2169

4 年前

chyroc / WechatSogou

基于搜狗微信搜索的微信公众号爬虫接口

WeChat sogou Python 爬虫 pypi scrapy

Python

6126

1706

2 年前

rmax / scrapy-redis

Redis-based components for Scrapy.

scrapy 爬虫 distributed Redis

Python

5638

1583

1 年前

SpiderClub / haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

high-availability scrapy ipproxy distributed Redis 爬虫 scheduler spider

Python

5491

908

3 年前

xiyouMc / WebHubBot

Python + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.

scrapy MongoDB Python

Python

5424

1501

6 年前

DropsDevopsOrg / ECommerceCrawlers

实战🐍多种网站、电商数据爬虫🕷。包含🕸：淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:

Python 爬虫 baidu-tieba taobao-spider dazhong-spider douban-movie douban-music alitask baotu quanjing fofa WeChat baidu scrapy

Python

5252

1414

1 年前

nghuyong / WeiboSpider

持续维护的新浪微博采集工具🚀🚀🚀

scrapy Python weibo weibospider

Python

3945

841

1 个月前

Gerapy / Gerapy

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

scrapy distributed webspider scrapyd dashboard spider Django Vue.js Docker

Python

3481

645

1 年前

Boris-code / feapder

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单，功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度

scrapy spider 爬虫 Python

Python

3415

517

7 个月前

my8100 / scrapydweb

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. Docs 文档 👉

scrapy scrapyd scrapyd-ui scrapyd-api scrapyd-admin scrapyd-manage log-parsing Logging scrapyd-monitor scrapyd-keeper dashboard spider

Python

3335

582

8 个月前

wkunzhi / Python3-Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

scrapy Python crawl 爬虫 geek spider taobao dianping meituan Selenium pyppeteer splash

Python

3271

1033

2 年前

scrapy-plugins / scrapy-splash

Scrapy+Splash for JavaScript integration

scrapy headless-browsers

Python

3227

456

8 个月前

CodeRayZhang / Movie_Recommend

基于Spark的电影推荐系统，包含爬虫项目、web网站、后台管理系统以及spark推荐系统

spark-mllib spark-streaming ssm-maven scrapy Scala hadoop nginx hive MySQL

Java

2969

1051

7 年前

DormyMo / SpiderKeeper

admin ui for scrapy/open source scrapinghub

scrapy dashboard scrapyd scrapyd-ui spider

Python

2769

502

2 年前

QianyanTech / Image-Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

google-images bing baidu Google pyqt scrapy spider

Python

2308

575

1 年前

librauee / Reptile

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Python requests scrapy spider

Python

1696

512

4 年前