Repository navigation

#

elt

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python
17901
1 小时前
Java
13527
6 小时前
dbt-labs/dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python
10679
21 小时前

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java
8444
2 天前

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Python
3485
1 小时前
quarylabs/quary
Rust
2303
2 个月前

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python
2249
5 小时前

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python
2031
1 天前

A system for agentic LLM-powered data processing and ETL

Python
1762
14 小时前

Dataform is a framework for managing SQL based data operations in BigQuery

TypeScript
898
1 天前

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB and MySQL

Go
811
1 天前

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times

JavaScript
793
3 年前

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Go
749
10 个月前

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

Go
648
1 天前

Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.

Go
554
2 天前