Repository navigation

#

bigdata

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook
27505
7 天前
juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go
11498
1 天前

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10055
2 年前

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Python
8367
6 个月前
databendlabs/databend

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust
8352
1 天前
Go
4588
1 天前

A data integration framework

Java
4043
1 个月前

100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript
3918
9 个月前

🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~

Vue
3450
1 年前

Apache Avro is a data serialization system.

Java
3051
11 小时前

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

Java
2878
4 天前

Python clone of Spark, a MapReduce alike framework in Python

Python
2682
4 年前

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

C++
2419
3 个月前
C#
2049
5 天前

基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法

Java
2045
1 年前