Repository navigation

#

xpath

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Go
16105
7 小时前
jhy/jsoup

jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

Java
11233
13 小时前

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

Java
10780
2 年前
D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Python
6451
3 天前

Light-weight, simple and fast XML parser for C++ with XPath support

C++
4334
3 个月前

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

C#
2781
1 个月前

A sensible way to deal with XML & HTML for iOS & macOS

Objective-C
2604
5 年前

parse and generate XML easily in go

Go
1616
4 个月前

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python
1262
23 天前

基于appium的app自动遍历工具

Scala
1216
2 年前

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

Swift
1098
1 年前

Command-line XML and HTML beautifier and content extractor

Go
990
8 天前
JavaScript
913
4 个月前

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Pascal
812
6 个月前

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Python
788
3 年前

htmlquery is golang XPath package for HTML query.

Go
769
2 个月前

BaseX Main Repository.

Java
724
9 小时前

XPath package for golang, supports HTML, XML, JSON document query and more

Go
723
2 天前