Repository navigation
delta-lake
- Website
- Wikipedia
Apache Doris is an easy-to-use, high performance and unified analytics database.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
A native Rust library for Delta Lake, with bindings into Python
Postgres Data Warehouse, built on Iceberg
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
An open protocol for secure data sharing
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Analytical database for data-driven Web applications 🪶
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Sample project to demonstrate data engineering best practices
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
A Minimalistic Rust Implementation of Delta Sharing Server.
This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline