Repository navigation

#

lakehouse

prestodb/presto

The official home of the Presto distributed SQL query engine for big data

Java
16522
21 小时前
Java
14371
5 小时前

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

Java
10727
18 小时前
databendlabs/databend

𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. Built for multimodal analytics. https://databend.com

Rust
8900
16 小时前

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java
3016
5 天前
C++
2215
6 个月前

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C++
2082
2 小时前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
2051
1 天前

Apache Fluss is a streaming storage built for real-time analytics.

Java
1484
4 天前

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB , MySQL and Oracle

Go
1137
8 小时前

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust
922
3 小时前

The Control Plane for Apache Iceberg.

TypeScript
359
11 天前

GigAPI is a Timeseries lakehouse for real-time data and sub-second queries, powered by DuckDB OLAP + Parquet Query Engine, Compactor w/ Cloud-Native Storage. Drop-in FDAP alternative ⭐

Go
346
13 天前

Examples of using Terraform to deploy Databricks resources

HCL
286
5 天前