Repository navigation

#

datawarehouse

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Java
3523
10 小时前

Postgres-native columnar storage extension

C
2978
6 个月前

Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

Rust
1555
1 年前

从数据仓库到用户画像,从数据建设到数据应用

600
4 年前

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

553
2 个月前

An open-source columnar data format designed for fast & realtime analytic with big data.

Java
453
3 年前

Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!

C#
422
1 年前

Hydra九头龙,面向PB级别知识库取数、情报系统、数据平台、大规模控制调度系统。面向大规模数据采集、分析、智能取数。——以实现大规模分布式爬虫搜索引擎为例。

Java
329
9 小时前

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

TSQL
278
4 个月前

Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases

Python
231
3 年前

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

JavaScript
230
5 个月前

Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)

Go
186
5 天前

All of my individual learning materials, documents, and notes from the process of getting the Coursera IBM Data Engineer Professional Certificate specialization are stored in this repository.

Jupyter Notebook
99
3 年前

A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.

82
7 个月前

Accelerator to build a Microsoft Fabric modern data platform using pre-built reusable Fabric items and an orchestration ELT Framework

TSQL
80
17 天前

implementing an end-to-end tweets ETL/Analysis pipeline.

Python
57
3 年前