Repository navigation

#

lakehouse

prestodb/presto

The official home of the Presto distributed SQL query engine for big data

Java
16460
4 小时前
Groovy
14114
7 小时前

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

Java
10504
4 小时前
databendlabs/databend

𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. Built for multimodal analytics. https://databend.com

Rust
8739
2 小时前

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java
2924
5 天前
C++
2209
5 个月前

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C++
2070
25 分钟前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
1771
8 小时前
Mooncake-Labs/pg_mooncake
Rust
1622
1 小时前

Apache Fluss is a streaming storage built for real-time analytics.

Java
1398
7 小时前

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB , MySQL and Oracle

Go
1009
6 小时前

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust
858
5 小时前

The Control Plane for Apache Iceberg.

TypeScript
334
11 天前

GigAPI is a Timeseries lakehouse for real-time data and sub-second queries, powered by DuckDB OLAP + Parquet Query Engine, Compactor w/ Cloud-Native Storage. Drop-in FDAP alternative ⭐

Go
306
5 小时前

Examples of using Terraform to deploy Databricks resources

HCL
277
2 天前