Repository navigation
structured-data
- Website
- Wikipedia
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Schema.org - schemas and supporting software
Knowledge Agents and Management in the Cloud
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.
Get clean data from tricky documents, powered by vision-language models ⚡
Visual Data Preparation and Transformation. Low-Code Python-based ETL.
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
DeepTables: Deep-learning Toolkit for Tabular data
Schema.org objects turned into strongly typed C# POCO classes for use in .NET. All classes can be serialized into JSON/JSON-LD and XML, typically used to represent structured data in the head section of html page.
PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Machine Learning library for the web and Node.
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
Identify hardcoded secrets in static structured text
All in One SEO plugin for WordPress SEO
Collection of structured data snippets in Google preferred JSON-LD format.