Repository navigation

#

data-pipelines

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java
13420
1 天前
StructuredLabs/preswald

Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, turning Python scripts into powerful shareable apps.

Python
3106
18 小时前
elementary-data/elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

HTML
2048
2 天前

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python
2031
1 天前

A system for agentic LLM-powered data processing and ETL

Python
1762
14 小时前
data-engineering-community/data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS
1650
12 天前
Scala
1516
5 个月前

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

Jupyter Notebook
1233
6 天前

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Rust
1060
4 个月前

Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.

TypeScript
1043
2 天前