Repository navigation

#

elt-pipeline

DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles

PLpgSQL
54
2 天前

An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using dbt and finally , a dashboard to visualize the data using looker studio, the pipeline is orchestrated using prefect

Python
37
2 年前

A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.

Python
34
19 天前

This project is an ETL / ELT Framework powered by DuckDB, designed to seamlessly integrate and process data from diverse sources. It leverages Markdown as a configuration medium, where YAML blocks define metadata for each data source, and embedded SQL blocks specify the extraction, transformation, and loading logic.

Go
23
6 天前

Decodes, packs, encodes, proves, stores and uploads block-replicas (primarily "block-specimens") produced by EVM or non-EVM byte code based blockchains.

Go
16
3 个月前

Modeling tool like DBT to use SQL Alchemy core with a DataFrame interface like

Python
11
2 年前

A modern data platform implemented on Azure Synapse Analytics using ELT Framework - https://github.com/bennyaustin/elt-framework. Data platform infrastructure provisioned using https://github.com/bennyaustin/iac-synapse-dataplatform

TSQL
9
1 年前

Public DBT instance to aid in data transformation for analytics purposes

Shell
7
1 天前

A data engineering project with dbt, Docker, Kestra, Terraform, GCP and Looker.

HCL
7
3 个月前

🌄📈📉 A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website 🔥

Python
5
1 年前

💻💛Fundamental Data Engineering Course 2024 Week4 Learn DBT Transform Data with Models, Macro, ELT-Pipeline with Dagster 🌎

Python
5
1 年前

This Repo contains all study, lab and supportive materials for Udemy course on "Google Cloud Professional Data Engineer - A Complete Guide".

Python
5
4 天前

🛸 This project showcases an Extract, Load, Transform (ELT) pipeline built with Python, Apache Spark, Delta Lake, and Docker. The objective of the project is to scrape UFO sighting data from NUFORC and process it through the Medallion architecture to create a star schema in the Gold layer that is ready for analysis.

Python
4
2 年前

🍺 A data engineering project showcasing an ELT pipeline using modern technologies such as Delta-rs, and Apache Airflow.

Python
4
2 年前
3
2 年前

A deep dive into North American grocery e-commerce behaviour based on Instacart's open dataset. [ELT, EDA, ML clustering]

Jupyter Notebook
2
1 年前

An end to end ELT project that uses data from the Zomato Restaurant, an Indian multinational restaurant aggregator and food delivery company. The project extracts data from Kaggle dataset, loads it into Snowflake tables, then is transformed and modelled in dbt Labs.

2
1 年前

Irish Property Price Register transformed into a data warehouse via an EtLT pipeline.

TypeScript
2
3 年前

This is an ELT data pipeline setup to track the activities of an e-commerce website based on orders, reviews, deliveries and shipment date. This project utilized technologies like Airflow, AWS RDS-Postgres, Python etc.

Python
2
1 年前