Repository navigation

#

unstructured-data

Zipstack/unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Python
5064
3 天前

🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

Makefile
2239
3 小时前

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

HTML
2089
3 天前

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Python
1636
24 天前

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

Java
1400
2 天前

LOTUS: A semantic query engine for fast and easy LLM-powered data processing

Python
1155
13 小时前

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Rust
1061
4 个月前

Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.

TypeScript
1045
13 小时前

Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!

Python
837
15 小时前

Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.

Python
380
1 年前

python implementation of jordansissel's grok regular expression library

Python
277
1 年前