Repository navigation

#

spark-sql

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python
27677
12 天前

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala
2232
5 天前
C#
2075
2 个月前

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala
1419
5 小时前

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala
1335
7 个月前

电商用户行为分析大数据平台

Java
1053
3 年前
Jupyter Notebook
440
3 年前

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

C#
304
5 个月前

Apache Spark™ and Scala Workshops

HTML
263
1 年前

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala
233
7 个月前

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

TypeScript
209
7 年前

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

Java
199
5 年前

Spark Structured Streaming / Kafka / Cassandra / Elastic

Scala
183
3 年前

Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤

Scala
140
6 年前