Repository navigation

#

html-parser

TypeScript
4639
3 天前

PostHTML is a tool to transform HTML/XML with JS plugins

JavaScript
2952
1 年前

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

C#
2792
1 天前

A React Native component which renders HTML content as native views

JavaScript
2728
1 年前

Kanna(鉋) is an XML/HTML parser for Swift.

Swift
2471
3 个月前
philss/floki

Floki is a simple HTML parser that enables search for nodes using CSS selectors.

Elixir
2129
1 个月前

A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

Dart
1877
7 个月前

Fast C/C++ HTML 5 Parser. Using threads.

C
1700
9 个月前

Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

Swift
1497
1 年前

Oga is an XML/HTML parser written in Ruby.

Ruby
1166
25 天前

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

Swift
1099
1 年前

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Kotlin
855
8 个月前

Heuristic based boilerplate removal tool

Python
796
7 个月前

htmlquery is golang XPath package for HTML query.

Go
770
7 天前

Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

C
758
4 年前

HTML as data

Clojure
672
1 个月前

Locally saves webpages to your hard disk with images, css, js & links as is.

Python
625
5 个月前

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Python
601
12 天前