Repository navigation

#

dataengineering

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook
38061
10 天前
open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
7649
2 小时前

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python
2633
1 天前

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

557
18 天前

The developer framework for building analytical backends on top of ClickHouse, Redpanda and other high-performance analytical infrastructure

Rust
439
1 天前

This repository provides various demos/examples of using Snowpark for Python.

Jupyter Notebook
283
2 年前

An open source development framework to help you build data workflows and modern data architecture on AWS.

TypeScript
270
5 个月前

Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.

HTML
232
2 个月前

end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence

Python
224
3 个月前

Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате

177
1 年前

Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data Engineers

110
2 年前