Repository navigation

#

dataengineering

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook
36975
1 天前
open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
7348
1 小时前

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python
2544
15 分钟前

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

553
2 个月前

The developer framework for building analytical backends on top of ClickHouse, Redpanda and other high-performance analytical infrastructure

Rust
337
4 分钟前

This repository provides various demos/examples of using Snowpark for Python.

Jupyter Notebook
283
1 年前

An open source development framework to help you build data workflows and modern data architecture on AWS.

TypeScript
269
4 个月前

Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.

HTML
226
1 个月前

end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence

Python
216
2 个月前

Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате

175
1 年前

Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data Engineers

110
2 年前