Repository navigation

#

hudi

Groovy
14118
2 小时前

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

Java
10511
1 小时前
Java
5916
2 小时前

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Shell
1654
3 个月前

Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.

Java
1039
9 小时前

【2025最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。

Java
931
6 天前

数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等

Java
414
12 天前

The native Rust implementation for Apache Hudi, with C++ & Python API bindings.

Rust
247
1 天前

Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.

Java
109
4 个月前

汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)

Java
75
5 年前

Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)

Kotlin
60
2 年前

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work

Jupyter Notebook
47
3 年前

dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats

HCL
30
2 年前

Consumption and writing to Hudi based on multiple topic

Java
8
5 年前