Repository navigation

#

iceberg

Java
13532
14 小时前

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java
11169
6 小时前

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

Java
9858
2 天前
Java
7227
10 小时前
alldatacenter/alldata

🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo

Java
2695
1 个月前
cocopon/iceberg.vim
Vim Script
2284
5 个月前

High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful features like JOIN, CDC, UPSERT, and LOOKUP, enabling real-time analytics and ETL at scale.

C++
1770
19 小时前
apache/polaris

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java
1435
1 天前

Single-binary Postgres read replica optimized for analytics

Go
1351
3 天前
projectnessie/nessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java
1182
3 天前

【2025最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。

Java
784
10 天前
Python
679
2 天前

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust
580
2 天前
Jupyter Notebook
431
2 年前

Open Control Plane for Tables in Data Lakehouse

Java
340
10 天前

Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg

327
2 年前

The athena adapter plugin for dbt (https://getdbt.com)

Python
247
2 个月前