Repository navigation
ingestion
- Website
- Wikipedia
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
Open Source Metering and Usage Based Billing API ⭐️ Consumption tracking, Subscription management, Pricing iterations, Payment orchestration & Revenue analytics
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Highly Performant, Modular, Memory Safe and Production-ready Inference, Ingestion and Indexing built in Rust 🦀
OpenSearch Data Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
✨ A extension can help you open git ingest to turn any git repository into a prompt-friendly text ingest for LLMs.
Use local files or public GitHub repository as a source and ask questions through ChatGPT about it
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
Apache Spark examples exclusively in Java
Ylem is an open-source platform for real-time data streaming orchestration
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Extensible streaming ingestion pipeline on top of Apache Spark
Python script for ingesting various files into a semantic graph. For text, images, cpp, python, rust, javascript, and PDFs.