Repository navigation
datalakehouse
- Website
- Wikipedia
Open Control Plane for Tables in Data Lakehouse
Lakevision is a tool which provides insights into your Apache Iceberg based Data Lakehouse.
"바로 쓰는 오라클 클라우드 - Build and Delploy Modern Apps with Oracle Cloud"의 전체 소스코드 저장소입니다.
Projeto dbt do Data Lake da Secretaria Municipal de Saúde
Connecting prestodb with external databases like mongodb, elasticsearch, mysql, hadoob etc to manipulate big data
Meu décimo primeiro projeto em que crio um datalakehouse usando computação distribuído no databricks
This repository provides a modular and easy-to-extend ETL pipeline that streams data from a PostgreSQL database into a StarRocks data warehouse using RisingWave as the real-time streaming computation layer.
A prototype for implementing datalake catalog management only based on arbitrary file systems
This project serves as a personal lab for developing and honing skills in distributed data processing and data lake architecture.
Sales Data Lakehouse Pipeline using Azure & Databricks
A scalable and optimized data warehouse solution designed for efficient data integration, transformation, and analytics. This project demonstrates ETL workflows, dimensional modeling, and query performance tuning using modern data warehousing practices.
Building a modern data warehouse with PostgreSQL, including ETL processes, data modeling, and analytics.
This repo is to run a quick demo for how to spin up an Apache Iceberg application.
We will create a sample lakehouse using Docker, execute an ETL process with Spark, and then access the data in the Iceberg table format from the Nessie Catalog.
StreamFlake: Real-Time CDC Pipeline with Kafka and Snowflake