Repository navigation

#

parser-library

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python
6666
2 个月前
postlight/parser

📜 Extract meaningful content from the chaos of a web page

JavaScript
5695
1 年前

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python
5458
16 天前
TypeScript
2663
1 小时前

Library to parse and work with the C++ AST

C++
1737
1 年前

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text

C#
1724
6 个月前

Industrial-strength monadic parser combinator library

Haskell
951
8 天前

Portable Executable parsing library (from PE-bear)

C++
658
4 个月前

📖🔬☕ BioJava is an open-source project dedicated to providing a Java library for processing biological data.

Java
612
1 个月前

Open Source SCPI device library

C
538
6 个月前

A sane rich text parsing and styling library.

Java
457
4 年前

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

Kotlin
439
5 天前

竜 TatSu generates Python parsers from grammars in a variation of EBNF

Python
429
2 个月前

🔪 Strictly RFC 3986 compliant URI parsing and handling library written in C89; moved from SourceForge to GitHub

C
368
1 天前

A library to parse C/C++ source as AST

C++
353
2 个月前

Library for snippet annotations

Rust
327
12 天前