Repository navigation

#

parser-library

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python
6481
11 天前
postlight/parser

📜 Extract meaningful content from the chaos of a web page

JavaScript
5588
9 个月前

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python
5208
1 个月前
TypeScript
2600
11 小时前

Library to parse and work with the C++ AST

C++
1723
10 个月前

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text

C#
1709
2 个月前

Industrial-strength monadic parser combinator library

Haskell
941
3 个月前

Portable Executable parsing library (from PE-bear)

C++
658
2 天前

📖🔬☕ BioJava is an open-source project dedicated to providing a Java library for processing biological data.

Java
605
1 个月前

Open Source SCPI device library

C
507
2 个月前

A sane rich text parsing and styling library.

Java
455
3 年前

竜 TatSu generates Python parsers from grammars in a variation of EBNF

Python
421
3 个月前

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

Kotlin
414
3 个月前

🔪 Strictly RFC 3986 compliant URI parsing and handling library written in C89; moved from SourceForge to GitHub

C
353
3 天前

Library for snippet annotations

Rust
313
4 天前

A library to parse C/C++ source as AST

C++
302
3 个月前