Repository navigation

#

bigdata

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook
38059
10 天前
juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go
12203
6 天前

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10292
2 年前
databendlabs/databend

𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. Built for multimodal analytics. https://databend.com

Rust
8900
17 小时前

🚀 High-performance distributed object storage for MinIO alternative.

Rust
8727
1 天前

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Python
8433
4 天前
Java
5968
13 小时前
Go
4989
6 天前

100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript
4502
2 个月前

A data integration framework

Java
4085
7 个月前

🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~

Vue
3460
2 年前

Apache Avro is a data serialization system.

Java
3154
6 天前

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

Java
3062
4 个月前

Python clone of Spark, a MapReduce alike framework in Python

Python
2680
5 年前

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

C++
2458
4 个月前
C#
2078
11 天前