Repository navigation

#

structured-data

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python
16086
36 分钟前
Rust
6215
1 天前

Schema.org - schemas and supporting software

HTML
5750
3 天前

学习C & C++ & python&汇编语言 LLVM编译器 数据结构 算法 操作系统 单片机 linux 面试

C
2999
1 年前

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala
2270
2 年前

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

Java
1684
5 天前
JavaScript
1389
10 天前

Visual Data Preparation and Transformation. Low-Code Python-based ETL.

JavaScript
1107
3 天前

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

Python
696
24 天前

Schema.org objects turned into strongly typed C# POCO classes for use in .NET. All classes can be serialized into JSON/JSON-LD and XML, typically used to represent structured data in the head section of html page.

C#
674
10 天前

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Python
547
4 年前

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505

Python
530
13 天前