Repository navigation

#

lakehouse

prestodb/presto

The official home of the Presto distributed SQL query engine for big data

Java
16306
5 小时前
Java
13524
2 小时前

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

Java
9853
9 小时前
databendlabs/databend

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust
8353
6 小时前

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java
2704
2 天前
C++
2174
1 个月前

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C++
2011
2 分钟前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
1446
21 小时前
Mooncake-Labs/pg_mooncake
C++
1297
1 小时前

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java
950
1 天前

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB and MySQL

Go
811
17 小时前

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust
579
10 小时前

Examples of using Terraform to deploy Databricks resources

HCL
250
8 天前

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.

Python
244
2 个月前

A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.

Python
240
2 天前