Repository navigation
elt
- Website
- Wikipedia
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Apache Doris is an easy-to-use, high performance and unified analytics database.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Flink CDC is a streaming data integration tool
The open source ELT framework powered by Apache Arrow
Privacy and Security focused Segment-alternative, in Golang and React
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Maestro: Netflix’s Workflow Orchestrator
A system for agentic LLM-powered data processing and ETL
Scalable and efficient data transformation framework - backwards compatible with dbt.
Open-source BI for engineers
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB , MySQL and Oracle
Dataform is a framework for managing SQL based data operations in BigQuery
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.