Repository navigation

#

xpath

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Go
16309
1 天前
jhy/jsoup

jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

Java
11254
3 天前

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

Java
10967
2 年前
D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Python
7418
3 天前

Light-weight, simple and fast XML parser for C++ with XPath support

C++
4380
1 个月前

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

C#
2792
1 天前

A sensible way to deal with XML & HTML for iOS & macOS

Objective-C
2600
5 年前

Parse, query and modify XML easily in go

Go
1632
1 个月前

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python
1273
20 天前

基于appium的app自动遍历工具

Scala
1221
2 年前

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

Swift
1099
1 年前

Command-line XML and HTML beautifier and content extractor

Go
1015
4 天前
JavaScript
914
6 个月前

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Pascal
815
7 个月前

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Python
786
3 年前

htmlquery is golang XPath package for HTML query.

Go
770
7 天前

BaseX Main Repository.

Java
729
2 天前

XPath package for golang, supports HTML, XML, JSON document query and more

Go
727
2 个月前